Self-CTRL RL Method Improves LLM Self-Explanation Accuracy from R²=0.24 to 0.64 | HACKOBAR_