[arXiv]score: 0.41
Distribution Corrected Offline Data Distillation for Large Language Models
May 15, 2026
Addresses distributional drift in offline LLM distillation by correcting the mismatch between teacher-conditioned training and student self-generated inference to reduce compounding errors in long reasoning traces.
cs.CL