Login / Signup
AV16.3: An Audio-Visual Corpus for Speaker Localization and Tracking.
Guillaume Lathoud
Jean-Marc Odobez
Daniel Gatica-Perez
Published in:
MLMI (2004)
Keyphrases
</>
audio visual
multi modal
audio visual speech recognition
speaker verification
multimedia
multi stream
visual information
emotion recognition
person authentication
particle filter
audio features
visual data
high level
search engine