E3RL Uses Intrinsic Entropy Signals to Prevent Cascading Reasoning Failures in LLMs | HACKOBAR_