Before the Last Token: Diagnosing Final-Token Safety Probe Failures | Hackobar