[arXiv]score: 0.21

HyPOLE Framework for Hyperproperty-Guided MARL

July 1, 2026

HyPOLE uses HyperLTL temporal logic to provide formal specifications for guiding multi-agent reinforcement learning under partial observability. The framework integrates Centralized Training for Decentralized Execution (CTDE) to synthesize policies that outperform standard reward-shaping baselines on SMAC and WildFire benchmarks.

HOW THIS AFFECTS YOU

●

researcherThis offers a mathematically rigorous alternative to reward shaping for complex multi-agent coordination tasks.

read original ↗arxiv.org

← back to feed