Behavioral safety evals miss latent vulnerabilities; Latent Vulnerability Score proposed | HACKOBAR_