Login / Signup
Active Speaker Detection Using Audio, Visual, and Depth Modalities: A Survey.
Siti Nur Aisyah Mohd Robi
Muhammad Atiff Zakwan Mohd Ariffin
Mohd Azri Mohd Izhar
Norulhusna Binti Ahmad
Hazilah Mad Kaidi
Published in:
IEEE Access (2024)
Keyphrases
</>
audio visual
multimodal fusion
multi modal
visual data
visual information
speaker verification
temporal context
multimedia
multi stream
emotion recognition
audio visual speech recognition
high dimensional
visual features
visual content
high dimensional data
nearest neighbor
low level
hidden markov models