[arXiv]score: 0.45
Correct Answers from Sound Reasoning: Verifiable Process Supervision for Language Models
May 14, 2026
Verifiable process supervision (VPS) framework jointly optimizes prediction accuracy and reasoning quality in language models, addressing failure modes where task accuracy improves while reasoning becomes less accurate or internally inconsistent.
cs.CLcs.AI