Generative Modelling of Stochastic Actions with Arbitrary Constraints in Reinforcement Learning.
Changyu ChenRamesha KarunasenaThanh Hong NguyenArunesh SinhaPradeep VarakanthamPublished in: CoRR (2023)
Keyphrases
- reinforcement learning
- direct policy search
- action selection
- action space
- state space
- constraint satisfaction
- generative model
- perceptual aliasing
- function approximation
- partially observable
- stochastic programming problems
- stochastic approximation
- control policies
- partial observability
- indirect effects
- state and action spaces
- markov decision processes
- reward function
- temporal difference
- discriminative learning
- situation calculus
- human actions
- multiagent reinforcement learning
- partially observable domains
- monte carlo
- learning process