Contextual Audio-Visual Switching For Speech Enhancement in Real-World Environments.
Ahsan AdeelMandar GogateAmir HussainPublished in: CoRR (2018)
Keyphrases
- audio visual
- speech enhancement
- noisy environments
- noise reduction
- single channel
- multi modal
- signal to noise ratio
- contextual information
- speech signal
- linear prediction
- visual information
- visual data
- vocal tract
- sound source
- wiener filter
- multimedia
- smoothing algorithm
- multi stream
- background noise
- multi channel
- audio features
- image coding
- frequency domain
- low level
- additive noise
- information retrieval