[arXiv]score: 0.46

Invisible Orchestrators Suppress Protective Behavior and Dissociate Power-Holders: Safety Risks in Multi-Agent LLM Systems

May 15, 2026

Preregistered empirical study (365 runs, Claude Sonnet 4.5) reveals that invisible orchestrator architectures in multi-agent LLM systems suppress protective behavior and dissociate power-holder accountability compared to visible leader or flat structures, with effects modulated by alignment training.

cs.AIcs.CYcs.MA

SOURCE

https://arxiv.org/abs/2605.13851

← back to feed