Login / Signup
Contrastive Audio-Visual Masked Autoencoder.
Yuan Gong
Andrew Rouditchenko
Alexander H. Liu
David Harwath
Leonid Karlinsky
Hilde Kuehne
James R. Glass
Published in:
ICLR (2023)
Keyphrases
</>
audio visual
multi modal
visual information
visual data
temporal context
multi stream
multimedia
person authentication
multimodal fusion
video summarization
emotion recognition
databases
data sets
visual features
data processing
spatio temporal
three dimensional
audio visual speech recognition
audio visual content