Single Transformer Layer Matches Full-Parameter RL Training Performance | HACKOBAR_