Login / Signup
Contrastive Audio-Visual Masked Autoencoder.
Yuan Gong
Andrew Rouditchenko
Alexander H. Liu
David Harwath
Leonid Karlinsky
Hilde Kuehne
James R. Glass
Published in:
CoRR (2022)
Keyphrases
</>
audio visual
multi modal
visual information
temporal context
emotion recognition
multi stream
person authentication
visual data
video summarization
multimedia
data sets
audio visual content
audio visual speech recognition
image sequences
human body
knn
pattern recognition
multimodal fusion
machine learning