Login / Signup
Speaker Adaptive Audio-Visual Fusion for the Open-Vocabulary Section of AVICAR.
Leda Sari
Mark Hasegawa-Johnson
Kumaran S
Georg Stemmer
Krishnakumar N. Nair
Published in:
INTERSPEECH (2018)
Keyphrases
</>
audio visual
multimodal fusion
multi modal
person authentication
speaker verification
visual information
multimedia
emotion recognition
temporal context
visual data
multi stream
audio features
keywords
pattern recognition
visual features