[arXiv]score: 0.46
Invisible Orchestrators Suppress Protective Behavior and Dissociate Power-Holders: Safety Risks in Multi-Agent LLM Systems
May 15, 2026
Preregistered empirical study (365 runs, Claude Sonnet 4.5) reveals that invisible orchestrator architectures in multi-agent LLM systems suppress protective behavior and dissociate power-holder accountability compared to visible leader or flat structures, with effects modulated by alignment training.
cs.AIcs.CYcs.MA