The Evaluation Trap: Benchmark Design as Theoretical Commitment | Hackobar