CoVRL Couples Variational Inference with RL to Enable Verifier-Free LLM Reasoning Training | HACKOBAR_