●builderYou should implement prompt canonicalization or ensemble consistency checks in any healthcare LLM pipeline to reduce sensitivity to phrasing variation.
●policyThis provides empirical grounding for requiring adversarial robustness testing as part of clinical AI validation and regulatory review.
●healthMedical-specific LLMs are not intrinsically safer than general-purpose ones — prompt sensitivity analysis should be a required step before any clinical deployment.