Login / Signup
Audio-Visual Activity Guided Cross-Modal Identity Association for Active Speaker Detection.
Rahul Sharma
Shrikanth Narayanan
Published in:
CoRR (2022)
Keyphrases
</>
audio visual
cross modal
multi modal
visual data
visual information
high dimensional
multimedia
visual similarity
audio features
image sequences
text mining
video data
image annotation
visual recognition