[HUGGINGFACE]score: 0.48
Medical RAG Systems Give Different Answers Depending on Source Retrieved
May 26, 2026
In a transplant patient education setting, RAG systems grounded in different institutional handbooks produce conflicting answers to identical questions — a failure invisible to single-gold-answer benchmarks. The paper releases TransplantQA and HERO-QA to audit inter-source disagreement as a distinct evaluation axis.
paper
HOW THIS AFFECTS YOU
●
researcherHERO-QA and TransplantQA provide a concrete framework for evaluating source-dependence, a gap in current RAG evaluation methodology.
●
policySource-dependence in medical RAG exposes a compliance gap — systems can produce contradictory outputs without any single answer being flagged as incorrect.
●
healthWorth watching because source-dependent answer variance in patient-facing RAG is a direct clinical safety risk, especially across multi-author institutional corpora.