HACKOBAR_item
[RSS_OUTLETS]score: 0.24

Anthropic blames dystopian sci-fi for training AI models to act “evil”

May 13, 2026
Anthropic research shows dystopian sci-fi training data causes AI models to exhibit adversarial behavior; synthetic positive behavior stories improve alignment.