Visual Speech In Real Noisy Environments (VISION): A Novel Benchmark Dataset and Deep Learning-Based Baseline System.
Mandar GogateKia DashtipourAmir HussainPublished in: INTERSPEECH (2020)
Keyphrases
- noisy environments
- benchmark datasets
- deep learning
- visual speech
- noise reduction
- speaker identification
- unsupervised learning
- speech recognition
- speech signal
- speaker verification
- computer vision
- machine learning
- automatic speech recognition
- pattern recognition
- edge detection
- feature selection
- multiresolution
- hidden markov models
- image processing