Build the Body. Code the Brain.
Welcome to the world's first spec-driven course on Physical AI & Humanoid Robotics. This isn't just a textbook—it's your guide to building intelligent robots that perceive, think, and act in the real world.
Written by Sharmeen Fatima, this course bridges the gap between software engineering and embodied intelligence, teaching you how to create robots that don't just exist in code, but move, sense, and respond in physical spaces.
What You'll Learn
This textbook covers the complete robotics stack:
| Module | Focus | Key Technologies |
|---|---|---|
| Module 1 | The Robotic Nervous System | ROS 2, rclpy, URDF |
| Module 2 | The Digital Twin | Gazebo, Unity, Simulation |
| Module 3 | The AI-Robot Brain | NVIDIA Isaac Sim, Nav2, SLAM |
| Module 4 | Vision-Language-Action | OpenAI Whisper, GPT-4, LLMs |
| Module 5 | Capstone Project | Full autonomous system integration |
Why This Course is Different
1. Spec-Driven Development
Every concept is built on:
- ✅ Clear specifications: Know exactly what you're building
- ✅ Testable outcomes: Verify your robot works at every step
- ✅ Production-ready code: No toy examples, real-world patterns
2. Co-Learning with AI
You're not alone! This course embraces modern tools:
- Use ChatGPT and Claude to debug faster
- Leverage LLMs for code generation
- Learn to work with AI, not against it
3. Hands-On from Day 1
No endless theory. You'll:
- Write your first ROS 2 node in Chapter 1
- Control a simulated robot by Module 2
- Build a voice-controlled autonomous robot by the Capstone
Who This Course is For
Beginners: If you know Python and want to enter robotics, start here. We use biological analogies to make ROS 2 intuitive.
Software Engineers: If you build web apps or AI models and want to work with physical systems, this course bridges the gap.
Robotics Enthusiasts: If you've tinkered with Arduino or Raspberry Pi and want to scale up to autonomous systems, this is your roadmap.
Prerequisites
Required:
- Comfortable with Python (functions, classes, async)
- Basic command-line usage (cd, ls, pip install)
- Familiarity with Git (clone, commit)
Recommended but not required:
- Undergraduate-level physics (helpful for simulation tuning)
- Experience with Linux (Ubuntu 22.04 preferred)
System Requirements:
- Minimum: Ubuntu 22.04, 8 GB RAM, 50 GB disk
- Recommended: Ubuntu 22.04, 16 GB RAM, NVIDIA RTX GPU, 100 GB disk
- Alternative: Docker on Windows/macOS (with performance trade-offs)
Course Philosophy
The Cyber-Physical Mindset
Robotics isn't just software or hardware—it's both. You'll learn to:
- Think in coordinate frames (not just variables)
- Design for physics (not just algorithms)
- Test in simulation before deploying to hardware
From Zero to Autonomous
We follow a progression:
- Nodes (communication)
- Simulation (safe testing)
- Navigation (autonomous movement)
- Intelligence (voice + vision)
- Integration (full system)
Each module builds on the last, but is independently valuable.
How to Use This Textbook
🎯 Linear Path (Recommended for Beginners)
Work through modules 1-5 in order. Each chapter has:
- Learning objectives: What you'll master
- Code examples: Copy, paste, and run
- Mermaid diagrams: Visualize architectures
- Hands-on exercises: Test your understanding
🚀 Skip to What You Need (For Experienced Learners)
- Already know ROS 2? Jump to Module 2 (Simulation)
- Want AI control ASAP? Skim Modules 1-2, focus on Module 4 (VLA)
- Need Nav2 only? Read Module 3 (Isaac Sim & Nav2)
💡 Best Practices
- Run every code example: Don't just read—execute
- Complete exercises: Passive learning doesn't work in robotics
- Break things: Debugging teaches more than perfect code
- Use AI assistants: ChatGPT, Claude, and Copilot are your friends
Visual Learning
This course uses 30+ Mermaid.js diagrams to explain:
- System architectures
- Data flows
- State machines
- Dependency graphs
Every technical concept has a visual aid.
The Capstone Project
Your final project: Build a voice-controlled autonomous humanoid that:
- Listens to commands ("Go to the kitchen")
- Parses with an LLM (GPT-4 or Gemini)
- Navigates using Nav2
- Executes in Gazebo or Isaac Sim
This isn't optional—it's the culmination of everything you've learned.
Getting Help
- Documentation: Every chapter links to official ROS 2, Nav2, and Isaac Sim docs
- Community: Join the Physical AI Discord (coming soon)
- Issues: Report errors at GitHub Issues
Ready to Begin?
Let's start with the fundamentals. Click below to dive into Module 1: The Robotic Nervous System.
Good luck, and welcome to the future of Physical AI! 🤖⚡
— Sharmeen Fatima