[arXiv]score: 0.11

LTLf Constraints Injected into Transformer RL Policies via DFA Regularization

June 9, 2026

A neuro-symbolic framework compiles LTLf temporal logic formulas into deterministic finite automata and integrates them as a differentiable regularization term during offline RL training with Decision or Trajectory Transformers. The approach enforces high-level temporal constraints without modifying the base architecture, making it applicable across transformer-based RL setups.

HOW THIS AFFECTS YOU

●

researcherThe DFA-to-differentiable-signal compilation is a clean mechanism for injecting formal temporal constraints into sequence-model RL — relevant if you work on constrained or safety-critical RL policies.

read original ↗arxiv.org

← back to feed