Staged VLM Training Shows Perception Bottlenecks Reasoning; RL Beats SFT for Visual Tasks | HACKOBAR_