[arXiv]score: 0.15
Token-Level Logit Gating Outperforms LLM Routing and Merging Baselines
June 4, 2026
DLLG trains a lightweight gating module to fuse token-level logits from multiple specialized LLMs using only response-level supervision, avoiding expert retraining or token-level labels. It consistently beats routing, heuristic ensembling, and parameter merging across reasoning and code benchmarks at multiple model scales.
cs.CL
HOW THIS AFFECTS YOU
●
builderYou can ensemble specialized models without retraining or parameter merging using a lightweight gating module, potentially improving task-specific accuracy with lower integration cost.
●
researcherSparse response-level supervision for token-level fusion is a practically useful training signal that sidesteps the label bottleneck in mixture-of-experts ensembling research.