Login / Signup
Learning to Answer Visual Questions from Web Videos.
Antoine Yang
Antoine Miech
Josef Sivic
Ivan Laptev
Cordelia Schmid
Published in:
CoRR (2022)
Keyphrases
</>
spatio temporal
low level
background knowledge
computer vision
web pages
high level
video streams