Multi-Agent Debate Hurts Data Generation but Boosts Error Detection by 27pp F1 | HACKOBAR_