[arXiv]score: 0.11

LLMs Show No Detectable Self-Preference When Revising Their Own Text on IFEval

June 19, 2026

Across four model families and 85 author-versus-fresh comparisons on IFEval, models acting as authors of their own drafts rejected verified-correct edits at the same rate as fresh models (gap: -5.1 pp, 95% CI [-12.9, ...]). The finding uses a deterministic verifier rather than another model as ground truth, avoiding circular evaluation.

HOW THIS AFFECTS YOU

●

researcherChallenges the assumption that self-preference bias extends to revision tasks — the deterministic verifier methodology is a useful template for cleanly isolating authorship effects.

read original ↗arxiv.org

← back to feed