[arXiv]score: 0.41
When Evidence Conflicts: Uncertainty and Order Effects in Retrieval-Augmented Biomedical Question Answering
May 15, 2026
Evaluates six open-weight biomedical LLMs on HealthContradict dataset under conflicting evidence conditions, measuring reliability when retrieved context is incomplete, misleading, or contradictory.
cs.CL