Humanoid robot market by 2035
Goldman Sachs
LLMs had the whole internet to read. Robots have no internet of actions - the two largest open robot datasets combined add up to roughly 5,000 hours. Physical AI is bottlenecked on one thing: first-person video of real humans doing real, skilled work.
Nxted Capture records that data from India’s skilled workforce - at a fraction of Western cost, with the diversity the scaling laws reward.
Real hands. Real tools. Real workshops.
Egocentric data for robotics is first-person video recorded from the worker’s own point of view as they perform a task - what a robot’s head- or wrist-mounted camera would see - plus depth, hand pose and 6-DoF motion. It teaches manipulation policies how a skilled human actually moves, which third-person or web video cannot.
India has one of the world’s largest skilled workforces across the trades and professions the scaling laws reward. nxted captures only verified, consented contributors in real workplaces, with fair pay and documented provenance. Skill depth and provenance - not cost - are the point, though Indian capture is also materially lower-cost.
Humanoid robot market by 2035
Goldman Sachs
Spent yearly by robotics firms buying real-world data
MIT Technology Review
The entire open-source robot dataset supply, combined
Scale AI
Policy gain from human egocentric data vs robot data alone
Meta · EgoMimic
Language models trained on trillions of web tokens. There is no equivalent corpus of physical manipulation - so robots have to be shown, demonstration by demonstration.
Peer-reviewed scaling laws (ICLR 2025) show robot-policy generalisation follows a power law in the number of environments and objects - exactly what a diverse, India-wide workforce provides.
Over $6B was invested in humanoid robotics in 2025. Robotics firms already spend $100M+ a year buying real-world data. The supply hasn’t caught up. That’s the gap.
We recruit and verify tradespeople across India - tailors, machinists, carpenters, chefs, medical staff - matched to the skill and level you need.
Workers wear egocentric (first-person) glasses built on research-standard devices, plus depth and hand-pose sensors for robotics-ready tiers.
We film the actual job in the actual workshop - not a sterile lab. Diversity of scene and object is what the scaling laws reward.
Action segmentation, first-person narration, hand-pose tracks, 6DoF trajectory, and skill-level metadata - the Ego4D / Ego-Exo4D methodology.
Every batch is reviewed by our expert team. Signed releases, PII blurring, and a GDPR-compliant DPA cover every frame before delivery.
LeRobot, RLDS, HDF5, or MP4 + sidecars - dropped straight into your imitation-learning or VLA training stack.
We don’t ship fictional proprietary hardware. Our rigs are built on Meta Project Aria, Intel RealSense, Stereolabs ZED, and the Universal Manipulation Interface - the same stack behind Ego-Exo4D and EgoMimic. Three tiers, matched to how training-ready you need the data to be.
High-volume, low-cost
Pre-training video corpora; scaling across many workers fast
Built on UMI-style GoPro capture
Research-grade egocentric
The human→humanoid recipe - same glasses on worker and robot
Built on Meta Project Aria
Direct policy training
Imitation learning & VLA training; Open X-Embodiment-ready
Built on Aria + RealSense + DexCap / UMI
Sorting, stacking, packing, basic assembly, labeling
Pack 50 items per minute across 5 product types
Tailoring, carpentry, cooking, cleaning, gardening
Complete garment assembly with hand and machine techniques
CNC operation, welding, electrical work, plumbing, electronics assembly
Set up and operate a CNC lathe for precision part manufacture
Surgical assistance, patient care, pharmacy, dental, lab work
Prep and assist in laparoscopic procedure, instrument handling
Heritage crafts, traditional medicine, precision jewellery, instrument making
Traditional Kanjivaram silk weaving - 40-step process
Every dataset ships in a consistent, robotics-ready structure - raw and processed video, multi-format episodes, full metadata, and the compliance pack. Below is the shape of a typical industrial delivery.
nxted_industrial_india_01/ dataset_card.md # scope, splits, limitations consent_manifest.csv # per-contributor consent + pay task_cards/ # what each episode demonstrates raw_video/ # original egocentric capture processed_video/ # redacted, stabilised, trimmed exo_video/ # third-person reference angle metadata/ # calibration, 6DoF poses, timestamps annotations/ # action labels, segments, success quality_report.pdf # inter-annotator agreement + QA lerobot/ # LeRobot-format episodes rlds/ # RLDS / TFDS shards hdf5/ # HDF5 trajectories
The consent, provenance, redaction and QA artifacts are bundled as the Data Trust Pack - the documentation your data, legal and safety teams need to sign off a dataset for production.
We don’t ask you to take our word for it. Here is what the leading labs and companies have published - every claim links to its source.
Our flagship vertical is skilled industrial and technical work - electrical panel assembly, machine tending, CNC setup, electronics assembly, and inspection - the data the free open datasets don't cover and gig networks can't reach.
Panel wiring · Electronics assembly · Machine tending · Inspection
Lathe ops · Mill setup · Quality inspection
Hand stitching · Machine sewing · Pattern cutting
Bricklaying · Tile setting · Pick and pack
Joinery · Lathe work · Lacquer finishing
Knife skills · Plating · Tandoor / griddle
Suturing · Instrument handoff · Patient prep
Real tradespeople doing real, skilled work - electrical assembly, machine tending, CNC setup, tailoring, surgical-adjacent tasks. The breadth of credentialed trades the scaling laws reward, and that Western lab-bound datasets lack.
Workers paid above local market rate, fully consented, with signed releases and PII/face/plate/screen redaction. Every frame is covered by a DPDP and GDPR-aligned DPA. Consent-first capture is the product, not an afterthought.
Every dataset ships with a Data Trust Pack: consent records, skill verification, reviewer credentials, a dataset card, and a QA report. You always know who produced the data and how.
Indian capture is also materially lower-cost, but cost is the footnote, not the pitch.
Market forecasts are quoted as attributed estimates and diverge across analysts. Cost differentials are industry estimates presented as ranges. Research citations link to primary sources (company sites, arXiv, dataset homepages).
Custom datasets quoted within 24 hours, in any format your pipeline already uses.
Egocentric data is first-person video recorded from the doer’s point of view - what a robot’s own camera would see - usually with depth, hand pose and 6-DoF motion. Robots need it because there is no web-scale corpus of physical actions; manipulation policies have to be shown how skilled humans move, demonstration by demonstration.
nxted builds on the rigs the research field already uses: Meta Project Aria glasses, Intel RealSense and Stereolabs ZED depth cameras, and Universal Manipulation Interface grippers - the same class of stack behind datasets like Ego-Exo4D. Three tiers map to how training-ready you need the data, from RGB-only up to full depth and hand pose.
The flagship vertical is skilled industrial and technical work - electrical panel assembly, machine tending, CNC setup, electronics assembly and inspection. nxted also captures tailoring and textile, construction and warehouse, carpentry, cooking, and medical-adjacent tasks, using credentialed contributors for the specialist categories.
Each dataset ships in LeRobot, RLDS and HDF5, plus raw and processed egocentric video, an optional third-person reference angle, full metadata (camera calibration, 6-DoF trajectories, hand pose, timestamps), action segmentation and success/failure labels, a dataset card, a consent manifest and a QA report.
Yes. Contributors are paid above the local market rate, give explicit, withdrawable consent, and every filming site signs a release. Faces, plates, screens and PII are redacted, no contributor is under 18, and each delivery includes a Data Trust Pack with a DPDP & GDPR-aligned DPA.
Start with a Physical AI Test Kit from $2,500 - 5 to 10 usable hours of one skilled task with a consent pack, metadata, basic labels and a LeRobot/RLDS/HDF5 sample, delivered in 7-10 days. Full datasets are priced per usable hour by skill level and quoted within 24 hours.