Generative Modelling of Stochastic Actions with Arbitrary Constraints in Reinforcement Learning.
Changyu ChenRamesha KarunasenaThanh Hong NguyenArunesh SinhaPradeep VarakanthamPublished in: NeurIPS (2023)
Keyphrases
- reinforcement learning
- action selection
- direct policy search
- perceptual aliasing
- partially observable
- action space
- state and action spaces
- function approximation
- constraint satisfaction
- machine learning
- state space
- stochastic programming problems
- ordering constraints
- partially observable domains
- constraint programming
- dynamical systems
- transfer learning
- control policies
- partial observability
- continuous state
- continuous state spaces
- monte carlo
- optimal policy
- stochastic dynamic programming