Audio-Visual Speech Codecs: Rethinking Audio-Visual Speech Enhancement by Re-Synthesis.
Karren YangDejan MarkovicSteven KrennVasu AgrawalAlexander RichardPublished in: CVPR (2022)
Keyphrases
- audio visual
- multi modal
- speech enhancement
- multi stream
- visual information
- multimedia
- emotion recognition
- speech signal
- speaker verification
- audio features
- noisy environments
- visual data
- single channel
- audio visual speech recognition
- noise reduction
- metadata
- multiscale
- vocal tract
- linear prediction
- signal to noise ratio
- feature space
- image sequences