[arXiv]score: 0.35

Bad Seeing or Bad Thinking? Rewarding Perception for Vision-Language Reasoning

May 15, 2026

Proposes a method to reward perception in Vision-Language Models to achieve robust perception-reasoning synergy without architectural redesign or agentic complexity, addressing the seesaw effect between perception and reasoning performance.

cs.AIcs.CV

SOURCE

https://arxiv.org/abs/2605.14054

← back to feed