Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments.
Giovanni MorroneLuca PasaVadim TikhanoffSonia BergamaschiLuciano FadigaLeonardo BadinoPublished in: CoRR (2018)
Keyphrases
- audio visual
- digit recognition
- speech enhancement
- speaker independent
- noisy environments
- speech recognition
- multi modal
- speech signal
- sound source
- speaker identification
- visual information
- single channel
- speaker verification
- noise reduction
- emotion recognition
- visual data
- multimedia
- signal to noise ratio
- vocal tract
- image processing
- audio features
- co occurrence
- machine learning
- feature selection
- image classification