Login / Signup
Audio-Visual Variational Fusion for Multi-Person Tracking with Robots.
Xavier Alameda-Pineda
Soraya Arias
Yutong Ban
Guillaume Delorme
Laurent Girin
Radu Horaud
Xiaofei Li
Bastien Mourgue
Guillaume Sarrazin
Published in:
ACM Multimedia (2019)
Keyphrases
</>
audio visual
person authentication
multimodal fusion
multi modal
visual information
multi stream
audio visual speech recognition
visual data
information fusion
emotion recognition
multimedia
image segmentation
multi camera
temporal context
data sets
feature vectors
computer vision