WARP Framework Recovers Training Data Portfolios from Model Weights
July 1, 2026
WARP utilizes model merging to interpolate between base and fine-tuned weights, creating pseudo-checkpoints to reconstruct training data mixtures. Unlike membership inference, this framework allows for the recovery of global training distribution characteristics and domain mixture weights from released model weights.
HOW THIS AFFECTS YOU
●
researcherYou can estimate the data composition of proprietary models using only their weights.
●
investorThis increases transparency regarding how competitors are sourcing and weighting their training data.