[HUGGINGFACE]score: 0.48

Local Perturbation Theory Explains Cross-Domain Interference in Multi-Domain RL Fine-Tuning

May 31, 2026

Single-domain RL post-training produces sparse, small-magnitude parameter edits that share active computation routes across domains, causing interference even when full-model gradients are nearly orthogonal — contradicting catastrophic forgetting and global gradient conflict explanations. The paper proves this under a local perturbation model and derives conditions for synergistic versus conflicting updates.

paper

HOW THIS AFFECTS YOU

●

researcherThe shared active computation route finding reframes multi-domain RL interference as a local, neuron-level phenomenon, providing a theoretically grounded target for designing multi-domain RL training schedules.

SOURCE

https://huggingface.co/papers/2606.02398

← back to feed