Mid-Training with Self-Generated Data Improves Reinforcement Learning in Language Models | HACKOBAR_