Minimax-Bayes Reinforcement Learning.
Thomas Kleine BueningChristos DimitrakakisHannes ErikssonDivya GroverEmilio JorgePublished in: CoRR (2023)
Keyphrases
- reinforcement learning
- function approximation
- decision trees
- temporal difference
- support vector
- evaluation function
- state space
- markov decision processes
- robotic control
- temporal difference learning
- alpha beta
- reinforcement learning algorithms
- optimal policy
- worst case
- support vector machine
- neural network
- transfer learning
- policy search
- multi agent reinforcement learning
- model free
- machine learning
- learning algorithm
- control policy
- bayes classifier
- bayesian networks
- neyman pearson
- reinforcement learning methods
- markov decision process
- supervised learning
- multi agent
- optimal control
- lower bound
- np hard
- learning problems
- upper bound