[arXiv]score: 0.10
Orthogonal Weight Edits Erase Diffusion Concepts Without Degrading Output Quality
May 29, 2026
Concept erasure in diffusion models fails when additive parameter updates entangle neuron direction, magnitude, and angular geometry simultaneously. This paper proposes orthogonal updates that decouple concept semantics (neuron direction) from generative capacity (angular geometry), achieving precise erasure without the collateral quality loss seen in methods like ROME-style edits.
cs.AI
HOW THIS AFFECTS YOU
●
builderIf the method holds up, it offers a deployment-friendly alternative to fine-tuning for content safety filtering in image generation pipelines.
●
researcherThe direction/magnitude/geometry decomposition offers a cleaner theoretical framing for weight-space editing that could generalize beyond diffusion models.