Login / Signup
Low-Latency Online Streaming VideoQA Using Audio-Visual Transformers.
Chiori Hori
Takaaki Hori
Jonathan Le Roux
Published in:
INTERSPEECH (2022)
Keyphrases
</>
audio visual
low latency
real time
stream processing
continuous query processing
multi modal
visual information
high throughput
highly efficient
multimedia
multi stream
visual data
high speed
virtual machine
data streams
audio visual speech recognition
database