Deep Ordinal Reinforcement Learning.
Alexander ZapTobias JoppenJohannes FürnkranzPublished in: CoRR (2019)
Keyphrases
- reinforcement learning
- function approximation
- model free
- reinforcement learning algorithms
- temporal difference
- state space
- markov decision processes
- learning algorithm
- multi agent
- learning process
- multi agent reinforcement learning
- action selection
- stochastic approximation
- reinforcement learning methods
- neural network
- optimal policy
- semi supervised
- learning problems
- policy search
- direct policy search
- control problems
- partially observable
- supervised learning
- dynamic programming
- hidden markov models
- information retrieval