[HUGGINGFACE]score: 0.85

Mega-ASR: Towards In-the-wild^2 Speech Recognition via Scaling up Real-world Acoustic Simulation

May 18, 2026

Mega-ASR addresses acoustic robustness in ASR by combining a 2M-sample compound dataset (Voices-in-the-Wild-2M) spanning 54 distortion scenarios with progressive acoustic-to-semantic training. Targets hallucination and omission failures under severe real-world noise. Relevant to production ASR teams where environmental robustness is a known gap versus clean-speech benchmarks.

paper

SOURCE

https://huggingface.co/papers/2605.19833

← back to feed