Importance Sampling Placement in Off-Policy Temporal-Difference Methods.
Eric GravesSina GhiassianPublished in: CoRR (2022)
Keyphrases
- importance sampling
- temporal difference methods
- monte carlo
- temporal difference
- function approximation
- evolutionary methods
- markov chain
- kalman filter
- particle filter
- policy search
- variance reduction
- td learning
- particle filtering
- reinforcement learning problems
- function approximators
- approximate inference
- reinforcement learning
- neural network
- evolutionary algorithm