HACKOBAR_item
[arXiv]score: 0.41

Efficient LLM Reasoning via Variational Posterior Guidance with Efficiency Awareness

May 13, 2026
Variational posterior guidance framework reduces LLM reasoning chain length by proving posterior distributions guided by reference answers achieve higher expected utility than prior policies.
cs.LGcs.AI