Login / Signup
AlignNet: A Unifying Approach to Audio-Visual Alignment.
Jianren Wang
Zhaoyuan Fang
Hang Zhao
Published in:
WACV (2020)
Keyphrases
</>
audio visual
multi modal
visual information
multimedia
multi stream
emotion recognition
video summarization
person authentication
visual data
temporal context
audio visual speech recognition
databases
nearest neighbor
natural language processing
spatio temporal
visual content