Randomized Policy Learning for Continuous State and Action MDPs.
Hiteshi SharmaRahul JainPublished in: CoRR (2020)
Keyphrases
- reinforcement learning
- continuous state
- policy search
- action space
- continuous state spaces
- state action
- action selection
- markov decision processes
- learning algorithm
- continuous state and action spaces
- optimal policy
- partially observable
- state dependent
- dynamic programming
- markov decision process
- control policies
- multi agent
- continuous action