HACKOBAR_item
[arXiv]score: 0.38

LEAP: Unlocking dLLM Parallelism via Lookahead Early-Convergence Token Detection

May 13, 2026
LEAP accelerates diffusion LLM inference by detecting early-converging tokens via lookahead statistical analysis, relaxing overly strict confidence thresholds that bottleneck parallelism. The method unlocks greater token-level parallelism without sacrificing accuracy, addressing a core scalability limitation of dLLMs. Critical for teams deploying or benchmarking diffusion-based language models at scale.
cs.LGcs.AI