Generalized Emphatic Temporal Difference Learning: Bias-Variance Analysis.
Assaf HallakAviv TamarRémi MunosShie MannorPublished in: CoRR (2015)
Keyphrases
- temporal difference learning
- fixed point
- function approximation
- evaluation function
- reinforcement learning
- bias variance analysis
- game playing
- temporal difference
- markov decision process
- learning algorithm
- monte carlo
- ensemble methods
- reinforcement learning algorithms
- search space
- machine learning
- least squares
- trade off
- policy iteration
- function approximators