Login / Signup
Large-scale multilingual audio visual dubbing.
Yi Yang
Brendan Shillingford
Yannis M. Assael
Miaosen Wang
Wendi Liu
Yutian Chen
Yu Zhang
Eren Sezener
Luis C. Cobo
Misha Denil
Yusuf Aytar
Nando de Freitas
Published in:
CoRR (2020)
Keyphrases
</>
audio visual
multi modal
visual information
multimedia
temporal context
multi stream
visual data
audio visual speech recognition
knowledge base
hidden markov models
emotion recognition
person authentication
data sets
databases
high dimensional
human body