Self-Distillation Plus RL Elicits Task-Solving in Video Diffusion Models | HACKOBAR_