Login / Signup
An Overview of Deep-Learning-Based Audio-Visual Speech Enhancement and Separation.
Daniel Michelsanti
Zheng-Hua Tan
Shi-Xiong Zhang
Yong Xu
Meng Yu
Dong Yu
Jesper Jensen
Published in:
IEEE ACM Trans. Audio Speech Lang. Process. (2021)
Keyphrases
</>
audio visual
deep learning
speech enhancement
sound source
single channel
noisy environments
noise reduction
multi modal
signal to noise ratio
unsupervised learning
visual information
visual data
speech signal
machine learning
multimedia
edge detection
mental models
speech recognition
data sets
hidden markov models