Login / Signup
Audio-Visual Embodied Navigation.
Changan Chen
Unnat Jain
Carl Schissler
Sebastia Vicenc Amengual Gari
Ziad Al-Halah
Vamsi Krishna Ithapu
Philip Robinson
Kristen Grauman
Published in:
CoRR (2019)
Keyphrases
</>
data sets
audio visual
multi modal
visual information
multimedia
person authentication
visual data
multi stream
temporal context
video summarization
emotion recognition
audio visual speech recognition
image data
information extraction
multiscale
high level
multimodal fusion