●researcherCross-model attribution divergence is a practical diagnostic for identifying where LLMs fail on structured tasks without relying on self-reported confidence.
●policyQuantified evidence that LLM confidence scores are non-informative on structured clinical tasks strengthens the case for external calibration requirements in high-stakes deployments.
●healthWorth watching because deploying LLMs on clinical tabular data with confidence-based safeguards is unreliable — verbalized confidence does not track prediction quality.