[arXiv]score: 0.17
TRACE Compresses Agent Trajectories for Long-Horizon Safety Detection
June 2, 2026
TRACE reframes long-horizon agent safety as trajectory-level evidence compression using a Compressor-Reader architecture: the Compressor encodes full trajectories into latent evidence states, which the Reader uses as safety references. Evaluated on ASSEBench, Pre-Ex-Bench, and R-Judge, TRACE improves over strong baselines by up to 12.6% accuracy across all tested backbones.
cs.AI
HOW THIS AFFECTS YOU
●
builderIf you're running long-horizon LLM agents in production, TRACE's trajectory-level supervision approach could improve safety detection where turn-level moderators miss compositional risks.
●
researcherThe Compressor-Reader design offers a concrete architectural pattern for aggregating sparse, delayed risk signals across long agent trajectories — worth benchmarking against your own safety detectors.
●
policyWorth watching because multi-step agent safety failures that evade local moderation are a key governance gap — this provides a measurable detection framework.