[arXiv]score: 0.13

KIT System Combines LLM Augmentation and Re-Ranking for Multilingual Speech Instruction Following

June 4, 2026

KIT's IWSLT 2026 submission builds over 1M training instances across six tasks and four languages by concatenating short-form corpora, using LLM-based label generation, and cross-lingual translation. Likelihood-based re-ranking improves ASR but degrades semantic tasks, a finding with practical implications for multi-task speech system design.

cs.CLeess.AS

HOW THIS AFFECTS YOU

●

builderThe data augmentation pipeline converting short-form corpora to long-form at scale is a replicable technique for low-resource multilingual speech tasks.

●

researcherThe re-ranking degradation finding on semantic tasks is a concrete negative result worth accounting for in multi-task speech pipeline design.

SOURCE

https://arxiv.org/abs/2606.04730

← back to feed