Observation Configuration#

This page is the complete reference for configuring observations in the deployment system. It covers the YAML configuration format, the encoder system, every available observation type, and how to create your own custom observations.

Configuration Format#

Observations are configured via a YAML file passed with --obs-config <path>. Each observation has a name (must match a registered observation) and an enabled flag.

Basic Structure#

observations:
  - name: "motion_joint_positions"
    enabled: true
  - name: "motion_joint_velocities"
    enabled: true
  - name: "motion_anchor_orientation"
    enabled: true
  - name: "base_angular_velocity"
    enabled: true
  - name: "body_joint_positions"
    enabled: true
  - name: "body_joint_velocities"
    enabled: true
  - name: "last_actions"
    enabled: true

Key rules:

  • Observations are concatenated in the order listed to form the policy input vector.

  • Offsets are calculated automatically — no manual offset management needed.

  • The total dimension of all enabled observations must match your ONNX model’s input size.

  • Disabled observations (enabled: false) are skipped entirely.

  • Reordering entries changes the layout of the input tensor (offsets shift accordingly).

With Encoder (Token-Based Policies)#

For policies that use an encoder to compress observations into a compact token, add an encoder: section:

observations:
  - name: "token_state"           # Encoder outputs (dimension set below)
    enabled: true
  - name: "base_angular_velocity" # Direct observations
    enabled: true
  - name: "body_joint_positions"
    enabled: true
  - name: "body_joint_velocities"
    enabled: true
  - name: "last_actions"
    enabled: true

encoder:
  dimension: 64       # Token output dimension
  use_fp16: false     # TensorRT precision for encoder (optional)
  encoder_observations:
    - name: "motion_joint_positions_10frame_step5"
      enabled: true
    - name: "motion_joint_velocities_10frame_step5"
      enabled: true
    - name: "motion_anchor_orientation_10frame_step5"
      enabled: true
    - name: "motion_root_z_position_10frame_step5"
      enabled: true
  encoder_modes:            # Optional: per-mode observation requirements
    - name: "g1"
      mode_id: 0
      required_observations:
        - motion_joint_positions_10frame_step5
        - motion_joint_velocities_10frame_step5
        - motion_anchor_orientation_10frame_step5
        - motion_root_z_position_10frame_step5

Encoder fields:

Field

Description

dimension

Token output dimension (must match encoder ONNX model output). Set to 0 or omit to disable encoder.

use_fp16

Use FP16 precision for encoder TensorRT engine (default: false).

encoder_observations

Observations fed to the encoder (superset of all modes). Same name/enabled format as policy observations.

encoder_modes

(Optional) Per-mode observation requirements. Observations not in a mode’s required_observations are zero-filled, saving computation.

Run with --encoder-file <path> to load the encoder model. If omitted, token_state can be set externally via ROS2/ZMQ.

See policy/observation_config_example.yaml for a complete annotated example.

Naming Convention#

Multi-frame observations follow the pattern: {base_name}_{N}frame_step{S}

  • N = number of frames gathered (temporal window size)

  • S = step size between frames (in control ticks at 50 Hz, so step5 = 0.1 s apart)

  • Without the suffix = single current frame only

For example, motion_joint_positions_10frame_step5 gathers 10 frames of joint positions, sampled every 5 ticks (0.1 s), giving a 0.9 s look-ahead window. If future frames exceed the motion length, the last frame is repeated.


Encoder & Token Observations#

These observations relate to the encoder (tokenizer) system. See With Encoder above for the YAML format.

Name

Dim

Description

token_state

config

Encoder output tokens (dimension set by encoder.dimension in YAML). Populated by local encoder inference or externally via ZMQ/ROS2.

encoder_mode

3

Current encoder mode ID + 2 zero-padding values.

encoder_mode_4

4

Current encoder mode ID + 3 zero-padding values.


Motion Reference Observations#

Gathered from the currently-active motion sequence (reference motions, planner output, or ZMQ stream). All joint data uses IsaacLab joint ordering (29 joints).

Joint Positions (from motion)#

Name

Dim

Frames

Step

Description

motion_joint_positions

29

1

Current frame joint positions (rad)

motion_joint_positions_3frame_step1

87

3

1

3-frame window, consecutive

motion_joint_positions_5frame_step5

145

5

5

5-frame window, 0.1 s apart

motion_joint_positions_10frame_step1

290

10

1

10-frame window, consecutive

motion_joint_positions_10frame_step5

290

10

5

10-frame window, 0.1 s apart

motion_joint_positions_lowerbody_10frame_step1

120

10

1

Lower-body joints only (12 joints), consecutive

motion_joint_positions_lowerbody_10frame_step5

120

10

5

Lower-body joints only, 0.1 s apart

motion_joint_positions_wrists_10frame_step1

60

10

1

Wrist joints only (6 joints), consecutive

motion_joint_positions_wrists_2frame_step1

12

2

1

Wrist joints only, 2 consecutive frames

Note

When upper-body control is active (e.g., via ZMQ/ROS2 teleoperation), the upper-body joint positions in these observations are replaced with the externally-provided targets.

Joint Velocities (from motion)#

Name

Dim

Frames

Step

Description

motion_joint_velocities

29

1

Current frame joint velocities (rad/s). Zero when not playing.

motion_joint_velocities_3frame_step1

87

3

1

3-frame window, consecutive

motion_joint_velocities_5frame_step5

145

5

5

5-frame window, 0.1 s apart

motion_joint_velocities_10frame_step1

290

10

1

10-frame window, consecutive

motion_joint_velocities_10frame_step5

290

10

5

10-frame window, 0.1 s apart

motion_joint_velocities_lowerbody_10frame_step1

120

10

1

Lower-body joints only, consecutive

motion_joint_velocities_lowerbody_10frame_step5

120

10

5

Lower-body joints only, 0.1 s apart

motion_joint_velocities_wrists_10frame_step1

60

10

1

Wrist joints only, consecutive

Anchor Orientation (from motion)#

Heading-corrected relative rotation from the robot’s current base orientation to the reference motion orientation. Output is the first two columns of the 3×3 rotation matrix (6 values per frame).

Name

Dim

Frames

Step

Description

motion_anchor_orientation

6

1

Current frame anchor orientation

motion_anchor_orientation_10frame_step1

60

10

1

10-frame window, consecutive

motion_anchor_orientation_10frame_step5

60

10

5

10-frame window, 0.1 s apart

Root Z Position (from motion)#

Name

Dim

Frames

Step

Description

motion_root_z_position

1

1

Current frame root height (m)

motion_root_z_position_3frame_step1

3

3

1

3-frame window, consecutive

motion_root_z_position_10frame_step1

10

10

1

10-frame window, consecutive

motion_root_z_position_10frame_step5

10

10

5

10-frame window, 0.1 s apart


SMPL Observations#

Gathered from SMPL data in the motion sequence (optional — requires motions with smpl_joint.csv / smpl_pose.csv).

SMPL Joint Positions#

3D positions per SMPL joint (24 joints × 3 = 72 per frame).

Name

Dim

Frames

Step

Description

smpl_joints

72

1

Current frame, all 24 SMPL joints

smpl_joints_2frame_step1

144

2

1

2 consecutive frames

smpl_joints_5frame_step5

360

5

5

5-frame window, 0.1 s apart

smpl_joints_10frame_step1

720

10

1

10-frame window, consecutive

smpl_joints_10frame_step5

720

10

5

10-frame window, 0.1 s apart

smpl_joints_lower_10frame_step1

270

10

1

Lower-body SMPL joints only (9 joints), consecutive

SMPL Poses (Axis-Angle)#

3D axis-angle per SMPL body part (21 poses × 3 = 63 per frame).

Name

Dim

Frames

Step

Description

smpl_pose

63

1

Current frame, all 21 SMPL poses

smpl_pose_5frame_step5

315

5

5

5-frame window, 0.1 s apart

smpl_pose_10frame_step1

630

10

1

10-frame window, consecutive

smpl_pose_10frame_step5

630

10

5

10-frame window, 0.1 s apart

smpl_elbow_wrist_poses_10frame_step1

120

10

1

Elbow + wrist poses only (4 parts), consecutive

SMPL Aliases#

These use the same gatherers as the motion observations but are intended for SMPL-based policies:

Name

Dim

Frames

Step

Description

smpl_root_z_10frame_step1

10

10

1

Root height, 10 consecutive frames

smpl_anchor_orientation_10frame_step1

60

10

1

Anchor orientation, 10 consecutive frames

smpl_anchor_orientation_2frame_step1

12

2

1

Anchor orientation, 2 consecutive frames


VR Tracking Observations#

VR 3-point and 5-point tracking data. When an external source (ZMQ/ROS2) provides VR data, buffered values are used directly. Otherwise, positions and orientations are computed from the motion sequence’s body data and normalised to the root body frame.

VR 3-Point#

Name

Dim

Description

vr_3point_local_target

9

3-point positions in root frame: [left_wrist xyz, right_wrist xyz, head xyz]

vr_3point_local_target_compliant

9

Same as above (identical during teleoperation)

vr_3point_local_orn_target

12

3-point orientations in root frame: [left quat wxyz, right quat wxyz, head quat wxyz]

vr_3point_compliance

3

Compliance values: [left_arm, right_arm, head]. Keyboard-controlled (g/h/b/v keys), range [0.0, 0.5].

VR 5-Point#

Name

Dim

Description

vr_5point_local_target

15

5-point positions in root frame: [left_wrist, right_wrist, head, left_ankle, right_ankle] × xyz

vr_5point_local_orn_target

20

5-point orientations in root frame: 5 quaternions × wxyz


Robot State History Observations#

Gathered from the StateLogger ring buffer (measured sensor data from the real robot). These provide temporal context by sampling past states.

Single-Frame (Current State)#

Name

Dim

Description

base_angular_velocity

3

IMU angular velocity (rad/s): [roll_rate, pitch_rate, yaw_rate]

body_joint_positions

29

Current joint positions from encoders (rad, IsaacLab order)

body_joint_velocities

29

Current joint velocities from encoders (rad/s, IsaacLab order)

last_actions

29

Previous policy output (normalised action values)

gravity_dir

3

Gravity direction in body frame (computed from base IMU quaternion)

Multi-Frame History (4 frames, step 1)#

Name

Dim

Description

his_body_joint_positions_4frame_step1

116

Joint positions: 4 consecutive ticks (29 × 4)

his_body_joint_velocities_4frame_step1

116

Joint velocities: 4 consecutive ticks

his_last_actions_4frame_step1

116

Past actions: 4 consecutive ticks

his_base_angular_velocity_4frame_step1

12

Angular velocity: 4 consecutive ticks (3 × 4)

his_gravity_dir_4frame_step1

12

Gravity direction: 4 consecutive ticks

Multi-Frame History (10 frames, step 1)#

Name

Dim

Description

his_body_joint_positions_10frame_step1

290

Joint positions: 10 consecutive ticks (29 × 10)

his_body_joint_velocities_10frame_step1

290

Joint velocities: 10 consecutive ticks

his_last_actions_10frame_step1

290

Past actions: 10 consecutive ticks

his_base_angular_velocity_10frame_step1

30

Angular velocity: 10 consecutive ticks (3 × 10)

his_gravity_dir_10frame_step1

30

Gravity direction: 10 consecutive ticks


Creating Custom Observations#

You can add your own observation types by modifying the C++ source. The observation system is built around a registry pattern — you write a gatherer function, register it with a name and dimension, and then use that name in your YAML config.

All observation code lives in gear_sonic_deploy/src/g1/g1_deploy_onnx_ref/src/g1_deploy_onnx_ref.cpp inside the G1Deploy class.

Step 1: Write a Gatherer Function#

A gatherer function reads from internal state (sensor data, motion data, etc.) and writes its output into a target buffer at a given offset. The signature is:

bool MyObservation(std::vector<double>& target_buffer, size_t offset) {
    // Write your observation values into target_buffer starting at offset.
    // Return true on success, false on failure (will stop the control loop).
}

Available data sources inside G1Deploy (see member variables in g1_deploy_onnx_ref.cpp for the full list):

Source

Description

state_logger_

Ring buffer of past robot states — IMU, joints, velocities, actions, hand states, token state

current_motion_ / current_frame_

Currently-active motion sequence and playback cursor

operator_state

Operator control flags (.play, .start, .stop)

vr_*_buffer_, left_hand_joint_buffer_, etc.

Buffered input interface data — VR tracking, hand joints, compliance, upper-body targets

heading_state_buffer_, movement_state_buffer_

Thread-safe buffers for heading and planner movement commands

Example — a custom observation that outputs the torso IMU angular velocity (3 values):

bool GatherTorsoAngularVelocity(std::vector<double>& target_buffer, size_t offset) {
    if (!state_logger_) { return false; }

    auto hist = state_logger_->GetLatest(1);
    if (hist.empty()) { return false; }

    const auto& entry = hist[0];
    target_buffer[offset + 0] = entry.body_torso_ang_vel[0];
    target_buffer[offset + 1] = entry.body_torso_ang_vel[1];
    target_buffer[offset + 2] = entry.body_torso_ang_vel[2];
    return true;
}

Step 2: Register in the Observation Registry#

Add your observation to the GetObservationRegistry() method in g1_deploy_onnx_ref.cpp. Each entry is a tuple of {name, dimension, gatherer_lambda}:

std::vector<ObservationRegistry> GetObservationRegistry() {
    return {
        // ... existing observations ...

        // Your custom observation:
        {"torso_angular_velocity", 3,
         [this](std::vector<double>& buf, size_t offset) {
             return GatherTorsoAngularVelocity(buf, offset);
         }},
    };
}

The name is the string you’ll use in the YAML config. The dimension must be exact — the system validates that the total of all enabled observations matches the ONNX model input size.

Step 3: Use in YAML Config#

Once registered, your observation is available like any built-in one:

observations:
  - name: "torso_angular_velocity"
    enabled: true
  # ... other observations ...

Tips#

  • Dimension must be fixed. The observation dimension is set at registration time and cannot change at runtime. If you need variable-size data, pad to a fixed maximum.

  • Don’t allocate in the hot path. Gatherer functions run at 50 Hz in the control loop. Avoid new, malloc, or resizing vectors. Pre-allocate buffers in the constructor or use stack arrays.

  • Return false carefully. Returning false from a gatherer stops the entire control loop. Only return false for unrecoverable errors. For missing optional data, write zeros and return true.

  • Thread safety. Gatherers run on the control thread. Reading from state_logger_ and DataBuffer objects is thread-safe. Accessing current_motion_ and current_frame_ is protected by current_motion_mutex_ (already held when GatherObservations() is called).

  • Multi-frame pattern. If your observation needs temporal windows, follow the existing GatherHis* or GatherMotion*MultiFrame patterns — they accept num_frames and step_size parameters and register multiple variants (e.g., my_obs, my_obs_4frame_step1, my_obs_10frame_step5).

  • Encoder observations. Custom observations can also be used as encoder inputs. Register them in the same registry — they’ll be available for both observations: and encoder_observations: in the YAML config.

  • Rebuild after changes. After modifying the C++ source, rebuild with just build from the gear_sonic_deploy/ directory.


Example Configurations#

Minimal (154D — default policy)#

observations:
  - name: "motion_joint_positions"       # 29D
    enabled: true
  - name: "motion_joint_velocities"      # 29D
    enabled: true
  - name: "motion_anchor_orientation"    # 6D
    enabled: true
  - name: "base_angular_velocity"        # 3D
    enabled: true
  - name: "body_joint_positions"         # 29D
    enabled: true
  - name: "body_joint_velocities"        # 29D
    enabled: true
  - name: "last_actions"                 # 29D
    enabled: true
# Total: 154D

Token-Based Policy with Encoder#

observations:
  - name: "token_state"                  # 64D (from encoder)
    enabled: true
  - name: "base_angular_velocity"        # 3D
    enabled: true
  - name: "body_joint_positions"         # 29D
    enabled: true
  - name: "body_joint_velocities"        # 29D
    enabled: true
  - name: "last_actions"                 # 29D
    enabled: true

encoder:
  dimension: 64
  use_fp16: false
  encoder_observations:
    - name: "motion_joint_positions_10frame_step5"   # 290D
      enabled: true
    - name: "motion_joint_velocities_10frame_step5"  # 290D
      enabled: true
    - name: "motion_anchor_orientation_10frame_step5" # 60D
      enabled: true
    - name: "motion_root_z_position_10frame_step5"   # 10D
      enabled: true

VR Teleoperation Policy#

observations:
  - name: "token_state"                         # 64D
    enabled: true
  - name: "vr_3point_local_target"              # 9D
    enabled: true
  - name: "vr_3point_local_orn_target"          # 12D
    enabled: true
  - name: "vr_3point_compliance"                # 3D
    enabled: true
  - name: "base_angular_velocity"               # 3D
    enabled: true
  - name: "body_joint_positions"                # 29D
    enabled: true
  - name: "body_joint_velocities"               # 29D
    enabled: true
  - name: "last_actions"                        # 29D
    enabled: true

See Configuration Format above for YAML syntax details.