Video Prediction Models as Rewards for Reinforcement Learning.
Alejandro EscontrelaAdemi AdenijiWilson YanAjay JainXue Bin PengKen GoldbergYoungwoon LeeDanijar HafnerPieter AbbeelPublished in: CoRR (2023)
Keyphrases
- reinforcement learning
- predictive model
- markov decision processes
- multimedia
- multi agent
- real time
- statistical models
- video data
- probabilistic model
- models built
- prediction accuracy
- spatial and temporal
- prediction model
- function approximation
- transition model
- markov models
- autoregressive
- video retrieval
- temporal information
- model selection
- state space
- machine learning