Towards Intelligibility-Oriented Audio-Visual Speech Enhancement.
Tassadaq HussainMandar GogateKia DashtipourAmir HussainPublished in: CoRR (2021)
Keyphrases
- audio visual
- speech enhancement
- signal to noise ratio
- multi modal
- noisy environments
- noise reduction
- speech signal
- speech recognition
- audio visual speech recognition
- visual data
- single channel
- multi stream
- visual information
- wiener filter
- multimedia
- vocal tract
- linear prediction
- audio features
- multi channel
- high dimensional
- automatic speech recognition
- sound source
- computer vision
- machine learning