Training Data#

BONES-SEED#

BONES-SEED (Skeletal Everyday Embodiment Dataset) is an open dataset of 142,220 annotated human motion animations for humanoid robotics, created by Bones Studio. It provides motion capture data in SOMA and Unitree G1 formats with natural language descriptions, temporal segmentation labels, and detailed skeletal metadata.

Total motions

142,220 (71,132 original + 71,088 mirrored)

Total duration

~288 hours (@ 120 fps)

Performers

522 actors (253 F / 269 M)

Age range

17–71 years

Height range

145–199 cm

Weight range

38–145 kg

Output formats

SOMA Uniform · SOMA Proportional · Unitree G1 MuJoCo-compatible

Annotations

Up to 6 NL descriptions per motion + temporal segmentation + skeletal metadata

Relevance to SONIC#

BONES-SEED a large subset of SONIC training data:

  • Unitree G1 joint trajectories — retargeted for MuJoCo, directly usable for motion tracking training

  • Broad motion coverage — locomotion, manipulation, dance, sports, communication, and everyday activities across 8 categories and 20 sub-categories

  • Rich language annotations — up to 6 natural language descriptions per motion, enabling language-conditioned policy learning

  • Temporal segmentation — per-motion phase labels with timestamps for structured skill decomposition

  • Performer diversity — 522 actors spanning a wide range of body types, ages, and movement styles

Motion Categories#

Package

Motions

Description

Locomotion

74,488

Walking, jogging, jumping, climbing, crawling, turning, and transitions

Communication

21,493

Gestures, pointing, looking, and communicative body language

Interactions

14,643

Object manipulation, pick-and-place, carrying, and tool use

Dances

11,006

Full-body dance performances across multiple styles

Gaming

8,700

Game-inspired actions and dynamic movements

Everyday

5,816

Household tasks, consuming, sitting, reading, and daily activities

Sport

3,993

Athletic movements and sports-specific actions

Other

2,081

Stunts, martial arts, and edge-case motions

Data Formats#

Every motion is available in three formats:

  • SOMA Proportional (BVH) — per-actor skeleton preserving original body proportions

  • SOMA Uniform (BVH) — standardized skeleton shared across all motions for batch processing

  • Unitree G1 (CSV) — joint-angle trajectories retargeted to the Unitree G1 humanoid

Download#

# Using the Hugging Face CLI
pip install huggingface_hub
huggingface-cli download bones-studio/seed --repo-type dataset --local-dir ./bones-seed
# Using Python
from huggingface_hub import snapshot_download

snapshot_download(
    repo_id="bones-studio/seed",
    repo_type="dataset",
    local_dir="./bones-seed"
)

After downloading, extract the motion archives: