VLM Judges Unreliable for Visually Impaired Assistance Tasks — GPT-5.4 Hits Only 52.6% Diagnostic Accuracy | HACKOBAR_