[arXiv]score: 0.35
Bad Seeing or Bad Thinking? Rewarding Perception for Vision-Language Reasoning
May 15, 2026
Proposes a method to reward perception in Vision-Language Models to achieve robust perception-reasoning synergy without architectural redesign or agentic complexity, addressing the seesaw effect between perception and reasoning performance.
cs.AIcs.CV