[arXiv]score: 0.41

Efficient LLM Reasoning via Variational Posterior Guidance with Efficiency Awareness

May 13, 2026

Variational posterior guidance framework reduces LLM reasoning chain length by proving posterior distributions guided by reference answers achieve higher expected utility than prior policies.

cs.LGcs.AI

SOURCE

https://arxiv.org/abs/2605.11019

← back to feed