[arXiv]score: 0.41
Efficient LLM Reasoning via Variational Posterior Guidance with Efficiency Awareness
May 13, 2026
Variational posterior guidance framework reduces LLM reasoning chain length by proving posterior distributions guided by reference answers achieve higher expected utility than prior policies.
cs.LGcs.AI