Login / Signup
VScript: Controllable Script Generation with Audio-Visual Presentation.
Ziwei Ji
Yan Xu
I-Tsun Cheng
Samuel Cahyawijaya
Rita Frieske
Etsuko Ishii
Min Zeng
Andrea Madotto
Pascale Fung
Published in:
CoRR (2022)
Keyphrases
</>
audio visual
multi modal
multimedia
visual information
audio visual speech recognition
video summarization
multi stream
visual data
temporal context
person authentication
emotion recognition
dimensionality reduction
computer vision
image data
pattern recognition
metadata
feature selection