LTLf Constraints Injected into Transformer RL Policies via DFA Regularization
June 9, 2026
A neuro-symbolic framework compiles LTLf temporal logic formulas into deterministic finite automata and integrates them as a differentiable regularization term during offline RL training with Decision or Trajectory Transformers. The approach enforces high-level temporal constraints without modifying the base architecture, making it applicable across transformer-based RL setups.
HOW THIS AFFECTS YOU
●
researcherThe DFA-to-differentiable-signal compilation is a clean mechanism for injecting formal temporal constraints into sequence-model RL — relevant if you work on constrained or safety-critical RL policies.