Login / Signup
A multi-channel/multi-speaker interactive 3D audio-visual speech corpus in Mandarin.
Jun Yu
Rongfeng Su
Lan Wang
Wenpeng Zhou
Published in:
ISCSLP (2016)
Keyphrases
</>
audio visual
multi channel
emotion recognition
speech corpus
multi modal
visual information
visual data
spoken document retrieval
speaker verification
multimedia
multi stream
automatic speech recognition
speech recognition
speech synthesis
feature selection
image data
low level