Login / Signup
Look&Listen: Multi-Modal Correlation Learning for Active Speaker Detection and Speech Enhancement.
Junwen Xiong
Yu Zhou
Peng Zhang
Lei Xie
Wei Huang
Yufei Zha
Published in:
CoRR (2022)
Keyphrases
</>
multi modal
audio visual
multi modality
prior knowledge
multimedia
high dimensional
image compression
speech recognition
cross modal