Deep Reinforcement Learning with Adjustments.
Hamed KhorasganiHaiyan WangChetan GuptaSusumu SeritaPublished in: CoRR (2021)
Keyphrases
- reinforcement learning
- function approximation
- state space
- markov decision processes
- database
- optimal policy
- learning algorithm
- stochastic approximation
- control problems
- temporal difference
- model free
- deep learning
- dynamic programming
- machine learning
- markov decision process
- policy search
- robot control
- belief nets
- reward function
- monte carlo
- dynamic environments
- evolutionary algorithm
- learning process
- real time