Login / Signup
Deep-learning-based audio-visual speech enhancement in presence of Lombard effect.
Daniel Michelsanti
Zheng-Hua Tan
Sigurdur Sigurdsson
Jesper Jensen
Published in:
Speech Commun. (2019)
Keyphrases
</>
audio visual
deep learning
speech enhancement
multi modal
unsupervised learning
visual information
noisy environments
machine learning
visual data
multimedia
noise reduction
signal to noise ratio
single channel
text classification
image features
speech signal
mental models
object recognition
data mining