[arXiv]score: 0.15

Token-Level Logit Gating Outperforms LLM Routing and Merging Baselines

June 4, 2026

DLLG trains a lightweight gating module to fuse token-level logits from multiple specialized LLMs using only response-level supervision, avoiding expert retraining or token-level labels. It consistently beats routing, heuristic ensembling, and parameter merging across reasoning and code benchmarks at multiple model scales.

cs.CL

HOW THIS AFFECTS YOU

●

builderYou can ensemble specialized models without retraining or parameter merging using a lightweight gating module, potentially improving task-specific accuracy with lower integration cost.

●

researcherSparse response-level supervision for token-level fusion is a practically useful training signal that sidesteps the label bottleneck in mixture-of-experts ensembling research.

SOURCE

https://arxiv.org/abs/2606.04378

← back to feed