COMPASS Handbook#
↗ Project page · arXiv · Source on GitHub
End-to-end pipeline#
Each stage has a top-level entry-point script in the repo:
# |
Stage |
Script |
|---|---|---|
1 |
Train residual RL specialist (or evaluate any policy) |
|
2 |
Roll out specialists to collect HDF5 distillation data |
|
3 |
Distil specialists into one generalist |
|
4 |
Export to ONNX / JIT |
|
5 |
Convert ONNX → TensorRT engine |
|
6 |
Deploy via ROS2 in Isaac Sim or on real robots |
Built on Isaac Lab 3.0 / Isaac Sim 4.5 and on top of NVIDIA’s X-Mobility base policy.
Where to start#
Three commands to a training shell.
Docker-as-venv: three commands and you’re in a training shell.
How the residual RL loop works, how to override embodiment / scene.
ROS2 nodes consuming a TensorRT engine — Isaac Sim, sim2real, or object navigation.
Submit training and evaluation runs to NVIDIA’s OSMO cluster.
Use COMPASS distillation datasets to fine-tune NVIDIA’s Isaac-GR00T VLA model.
What’s where in the repo#
COMPASS/
├── compass/
│ ├── residual_rl/ # Actor-critic that adds a residual on top of X-Mobility
│ ├── distillation/ # Generalist distillation (PyTorch Lightning trainer)
│ ├── rl_env/ # Isaac Lab `mobility_es` extension (embodiments + scenes)
│ └── utils/ # Shared helpers
├── configs/ # Gin configs (train / eval / record / distillation / shared)
├── docker/ # Docker-as-venv dev environment + Dockerfiles
├── osmo/ # OSMO workflow YAMLs + Python launcher
├── ros2_deployment/ # ROS2 packages for Isaac Sim / real-robot deployment
├── scripts/ # Standalone tools (HDF5→LeRobot, omap generator, …)
└── x_mobility/ # Vendored X-Mobility wheel
For deeper layout details, see CLAUDE.md.