A First Empirical Study of Emphatic Temporal Difference Learning.
Sina GhiassianBanafsheh RafieeRichard S. SuttonPublished in: CoRR (2017)
Keyphrases
- empirical studies
- temporal difference learning
- function approximation
- fixed point
- reinforcement learning
- evaluation function
- game playing
- empirical analysis
- temporal difference
- approximate value iteration
- monte carlo
- reinforcement learning algorithms
- markov decision process
- cost function
- collaborative learning
- function approximators