Deep Ordinal Reinforcement Learning.

Alexander Zap Tobias Joppen Johannes Fürnkranz

Published in: CoRR (2019)

Keyphrases

reinforcement learning
function approximation
model free
reinforcement learning algorithms
temporal difference
state space
markov decision processes
learning algorithm
multi agent
learning process
multi agent reinforcement learning
action selection
stochastic approximation
reinforcement learning methods
neural network
optimal policy
semi supervised
learning problems
policy search
direct policy search
control problems
partially observable
supervised learning
dynamic programming
hidden markov models
information retrieval