●researcherThe lack of controlled A/B comparisons against Opus means published capability claims for Mythos/Fable should be treated as anecdotal until replicated under consistent conditions.
●policyUnverified jailbreak capability claims complicate risk assessments — absence of rigorous baselines makes it unclear whether these represent genuine safety regressions.