
Test Blog Post
Short summary of first blogpost
Real World. Real Intent. Raw Data.
High-quality, intent-rich POV datasets to fuel the next generation of Embodied AI.
The Problem
Training robots for the real world requires real-world data. Capturing authentic human behavior at scale has been the missing piece.
Controlled environments miss real-world messiness—clutter, varied lighting, unexpected obstacles.
Manual data collection is slow and expensive. Hours of operator time for minutes of useful data.
Simulated environments fail to capture household physics and the unpredictability of real homes.
One-time data releases can't keep up. Robots need continuous, fresh data for ongoing learning.
Our Solution
Phone-first egocentric capture from real homes—focused on goal-driven, intent-rich tasks.
First-person perspective mirrors how robots will perceive the world—not third-party observation.
Real-world messiness: clutter, varied lighting, unexpected layouts. Non-staged, diverse homes.
Long-horizon, goal-driven tasks. What to achieve matters more than exact trajectories.
Phone-first capture scales effortlessly. Lightweight mounts, no expensive hardware.
First-person POV
Verb + Noun labels
Goal-level task packs segmented into atomic actions with precise timestamps.
Quality at every step while scaling collection
across our global contributor network.
One-time setup test ensures proper mount and framing.
Contributors record household tasks from egocentric view.
Secure upload via presigned URLs. Zero server contact.
Automated + manual review for technical and task quality.
Verb+Noun segmentation with precise timestamps.
Versioned dataset packs for your training pipeline.
Every video segmented into atomic actions following EPIC-KITCHENS vocabulary. Precise timestamps, machine-readable labels.
open+doorpick_up+glassplace+shelfWhat We Collect
15-20 core household tasks focused on activities
robots need to master for real-world deployment.
0
Tasks Defined
0
Categories
∞
Scalable
Login required
Built on EPIC-KITCHENS Verb-Noun schema,
extended for whole-home manipulation tasks.
0+
Verbs
0+
Nouns
0+
Categories
Key Features
Verb Categories
Mapping-ready for RT-X, Ego4D, and VLA pipelines
Compatible with
Everything to integrate into your pipeline. Originals + standardized encodes included.
Original source recordings preserved in full quality. Standardized encodes available for training compatibility: 1080p, 30fps, H.264 codec. Consistent aspect ratios across all videos.
Different home styles, clutter levels, and cultural patterns.
Robust training data for models that generalize.
Active Regions
50+
Q2
Q2
2025
Why Diversity Matters
Different floor plans, room sizes, and furniture arrangements.
Natural and artificial light across times of day and seasons.
Non-staged homes with authentic object placement and density.
Schema docs, type definitions, and field specs
Preview datasets with full annotations
PyTorch loaders, training scripts, examples
Methodology papers and benchmark results
Research datasets or custom collection solutions—we'd love to hear from you.
What happens next
Earn money recording everyday tasks from home