Worst-Dimension Optimization Targets Weakest Reasoning Step in Multimodal PRMs | HACKOBAR_