●builderYou can integrate MIRAGE as a two-channel runtime monitor to catch covert exfiltration attempts in agentic pipelines before output is produced.
●researcherIdentifies a mechanistically interpretable, architecture-agnostic subspace for encoding behavior, opening a new direction for probing agent internals.
●policyProvides a detection mechanism for a concrete agentic threat model — covert data exfiltration via encoding — that evades output-side filters.