Tracking as Online Decision-Making: Learning a Policy from Streaming Videos with Reinforcement Learning.
James Steven Supancic IIIDeva RamananPublished in: ICCV (2017)
Keyphrases
- reinforcement learning
- online learning
- action selection
- learning process
- decision making
- learning algorithm
- real time
- learning problems
- learning tasks
- function approximation
- supervised learning
- partially observable environments
- markov decision processes
- optimal policy
- rl algorithms
- sequential decision making
- actor critic
- video frames
- video sequences
- multimedia
- markov decision process
- function approximators
- temporal difference learning
- reinforcement learning problems
- balancing exploration and exploitation