What we deliver
Research-ready dataset packs with raw footage, processed copies, manifests, and schema versioning.
Raw Capture Pack
Unprocessed egocentric footage
Direct phone recordings from our contributor network. Includes original video files, device metadata, and capture timestamps. Full quality preserved.
What's included
Standardized Dataset
Training-ready data packs
Quality-validated footage with standardized encodes (1080p, 30fps, H.264). Includes original files plus processed versions ready for integration.
What's included
Annotated Dataset
Temporal action labels
Verb+Noun action segments with timestamped boundaries. EPIC-KITCHENS compatible taxonomy. Currently in pilot with select partners.
What's included
Technical Notes
Data specifications and compatibility
Capture Protocol
- • First-person (egocentric) perspective
- • 1080p recommended, original quality preserved
- • 5-second scene scan before each task
- • Landscape orientation standard
Annotation Schema
Pilot- • Verb-Noun action segments
- • EPIC-KITCHENS compatible taxonomy
- • Timestamp-based (seconds, not frames)
- • 70+ verbs, 175+ nouns
Compatibility
- • RGB video for VLA/behavior cloning
- • Scene scan enables optional SfM by customer
- • Raw files preserved for flexibility
- • JSONL export format
Data Processing Pipeline
From raw capture to annotation-ready standardized files
Raw Capture
Original 4K/1080p from contributor devices
QC & Validation
Frame check, audio sync, content review
Standardization
720p H.264 30fps CFR encode
Delivery
Raw + Proxy + Manifests package
Standardization Specs
Why standardize? Raw mobile videos have variable frame rates (VFR), different codecs (HEVC/H.264), and varying resolutions. Our pipeline normalizes everything to a consistent format for reliable annotation timestamps and seamless integration with your training pipelines.