LLMs Show No Detectable Self-Preference When Revising Their Own Text on IFEval
June 19, 2026
Across four model families and 85 author-versus-fresh comparisons on IFEval, models acting as authors of their own drafts rejected verified-correct edits at the same rate as fresh models (gap: -5.1 pp, 95% CI [-12.9, ...]). The finding uses a deterministic verifier rather than another model as ground truth, avoiding circular evaluation.
HOW THIS AFFECTS YOU
●
researcherChallenges the assumption that self-preference bias extends to revision tasks — the deterministic verifier methodology is a useful template for cleanly isolating authorship effects.