Login / Signup
Joint Learning of Policy with Unknown Temporal Constraints for Safe Reinforcement Learning.
Lunet Yifru
Ali Baheri
Published in:
CoRR (2023)
Keyphrases
</>
reinforcement learning
temporal constraints
learning algorithm
optimal policy
temporal reasoning
action selection
actor critic
policy search
objective function
constraint propagation
consistency checking
simple temporal