[arXiv]score: 0.41

Scaling Laws for Mixture Pretraining Under Data Constraints

May 14, 2026

Studies scaling laws for mixture pretraining when target data is limited, quantifying the trade-off between underexposure and overfitting when combining scarce domain-specific data with abundant generic data.

cs.LGcs.CL

SOURCE

https://arxiv.org/abs/2605.12715

← back to feed