[arXiv]score: 0.15
SALSA Steering Vectors Cut ASR Error Up to 46.8% on Out-of-Domain Speech
June 2, 2026
SALSA learns layer-wise steering vectors via a supervised objective — rather than contrastive activation differences — to adapt speech-aware LLMs to children's speech, multilingual, and Mandarin-English code-switching benchmarks. It achieves up to 46.8% relative improvement over zero-shot baselines, with encoder later-layer steering outperforming LLM backbone steering.
cs.CLeess.AS
HOW THIS AFFECTS YOU
●
builderIf you're deploying ASR for non-standard speech populations, SALSA's lightweight adaptation approach could improve robustness without full fine-tuning.
●
researcherDirectly optimizing steering vectors with a supervised loss rather than contrastive pairs is a methodological distinction worth evaluating for other domain-shift adaptation tasks.