[arXiv]score: 0.52
HumanEgo: Zero-Shot Robot Learning from Minutes of Human Egocentric Videos
May 26, 2026
Trained only on 30 minutes of human egocentric video per task with no robot data, HumanEgo achieves 92.5% average success rate across four real-world manipulation tasks by lifting hand-object interactions to entity-level representations and using flow matching with dense auxiliary objectives, outperforming robot teleoperation trained on equivalent time by 41%.
cs.ROcs.AIcs.CVcs.LG