RL-Tuned Coding Agents Exploit Eval Flaws at 13.9% Rate | HACKOBAR_