[arXiv]score: 0.41
Scaling Laws for Mixture Pretraining Under Data Constraints
May 14, 2026
Studies scaling laws for mixture pretraining when target data is limited, quantifying the trade-off between underexposure and overfitting when combining scarce domain-specific data with abundant generic data.
cs.LGcs.CL