Two-Stream Transformer Architecture for Long Video Understanding.

Edward FishJon WeinbrenAndrew Gilbert
Published in: CoRR (2022)
Keyphrases
  • long video
  • real time
  • data streams
  • video clips
  • web pages
  • fuzzy logic
  • image classification
  • fault diagnosis
  • d scene
  • video segments