[arXiv]score: 0.41

Distribution Corrected Offline Data Distillation for Large Language Models

May 15, 2026

Addresses distributional drift in offline LLM distillation by correcting the mismatch between teacher-conditioned training and student self-generated inference to reduce compounding errors in long reasoning traces.

cs.CL

SOURCE

https://arxiv.org/abs/2605.14071

← back to feed