[HUGGINGFACE]score: 0.80
Be Kind, Rewrite: Benign Projections via Rewriting Defend Against LLM Data Poisoning Attacks
May 17, 2026
Open-book benign rewriting (OBBR) defense against LLM backdoor attacks theoretically guarantees higher benign output probability than closed-book rewriting by leveraging reference samples during inference.
paper