Login / Signup
Improving Visual Speech Enhancement Network by Learning Audio-visual Affinity with Multi-head Attention.
Xinmeng Xu
Yang Wang
Jie Jia
Binbin Chen
Dejun Li
Published in:
INTERSPEECH (2022)
Keyphrases
</>
audio visual
multi modal
visual information
prior knowledge
image processing
visual data
computer vision
visual features
multi stream
speech enhancement