Tracking as Online Decision-Making: Learning a Policy from Streaming Videos with Reinforcement Learning.

James Steven Supancic III Deva Ramanan

Published in: ICCV (2017)

Keyphrases

reinforcement learning
online learning
action selection
learning process
decision making
learning algorithm
real time
learning problems
learning tasks
function approximation
supervised learning
partially observable environments
markov decision processes
optimal policy
rl algorithms
sequential decision making
actor critic
video frames
video sequences
multimedia
markov decision process
function approximators
temporal difference learning
reinforcement learning problems
balancing exploration and exploitation