[arXiv]score: 0.24

Anchored Confabulation: Partial Evidence Non-Monotonically Amplifies Confident Hallucination in LLMs

April 30, 2026

Anchored Confabulation: Partial Evidence Non-Monotonically Amplifies Confident Hallucination in LLMs Researchers from arXiv introduce anchored confabulation, a newly characterized failure mode where providing one confirmed intermediate reasoning step non-monotonically spikes confident hallucination rates in LLMs before full evidence suppresses them. Parametric Hallucination Confidence scores follow a non-linear arc of 0.613 to 0.656 to 0.595 to 0.536 across a causal injection experiment with N=160, with capability scaling confirmed across five model families at Spearman rho=0.900. The Anchoring Threshold Law k*(n)=floor(n/3) predicts PHC amplification by reasoning hop depth with four confirmed predictions, and a LearnedRouter exploiting PHC closes 81.1% of the oracle performance gap on 1,800 RAG queries at macro F1=0.426. RAG system architects and reasoning pipeline engineers must audit partial-evidence injection points immediately, as this finding reframes partial grounding not as a safety buffer but as a potential

cs.CL

SOURCE

https://arxiv.org/abs/2604.25931

← back to feed