Login / Signup
Look at What I'm Doing: Self-Supervised Spatial Grounding of Narrations in Instructional Videos.
Reuben Tan
Bryan A. Plummer
Kate Saenko
Hailin Jin
Bryan Russell
Published in:
NeurIPS (2021)
Keyphrases
</>
instructional videos
spatial data
spatial information
content analysis
feature extraction
spatial relationships