Learning with Rare Success but Rich Feedback via Reflection-Enhanced Self-Distillation | Hackobar