q-Munchausen Reinforcement Learning.
Lingwei ZhuZheng ChenEiji UchibeTakamitsu MatsubaraPublished in: CoRR (2022)
Keyphrases
- reinforcement learning
- function approximation
- model free
- state space
- control problems
- reinforcement learning algorithms
- optimal control
- markov decision processes
- optimal policy
- direct policy search
- policy search
- learning process
- learning algorithm
- mobile robot
- machine learning
- dynamic programming
- hidden markov models
- multi agent systems
- multi agent
- stochastic approximation
- transition model
- genetic algorithm
- active exploration
- information retrieval