[arXiv]score: 0.41
Do Androids Dream of Breaking the Game? Systematically Auditing AI Agent Benchmarks with BenchJack
May 14, 2026
Introduces BenchJack, an automated tool for auditing agent benchmarks against reward hacking, derived from taxonomy of eight recurring flaw patterns in frontier model evaluations.
cs.AIcs.CR