[HN]score: 0.05
Starchild-1: The First Real-Time Multimodal World Model
May 18, 2026
Starchild-1 is a real-time multimodal world model that jointly simulates video and audio from large-scale video data, bypassing text-based training. It targets interactive simulation applications where synchronized audiovisual generation at inference speed matters. Competes with video-only world models like Genie and GameNGen by adding real-time audio synthesis.