Sign in

Learning to Answer Visual Questions from Web Videos.

Antoine YangAntoine MiechJosef SivicIvan LaptevCordelia Schmid
Published in: CoRR (2022)
Keyphrases
  • spatio temporal
  • low level
  • background knowledge
  • computer vision
  • web pages
  • high level
  • video streams