[arXiv]score: 0.13
KIT System Combines LLM Augmentation and Re-Ranking for Multilingual Speech Instruction Following
June 4, 2026
KIT's IWSLT 2026 submission builds over 1M training instances across six tasks and four languages by concatenating short-form corpora, using LLM-based label generation, and cross-lingual translation. Likelihood-based re-ranking improves ASR but degrades semantic tasks, a finding with practical implications for multi-task speech system design.
cs.CLeess.AS
HOW THIS AFFECTS YOU
●
builderThe data augmentation pipeline converting short-form corpora to long-form at scale is a replicable technique for low-resource multilingual speech tasks.
●
researcherThe re-ranking degradation finding on semantic tasks is a concrete negative result worth accounting for in multi-task speech pipeline design.