Audio-Visual Speech Enhancement With Selective Off-Screen Speech Extraction.
Tomoya YoshinagaKeitaro TanakaShigeo MorishimaPublished in: CoRR (2023)
Keyphrases
- audio visual
- speech enhancement
- noisy environments
- noise reduction
- speech signal
- multi modal
- signal to noise ratio
- single channel
- audio visual speech recognition
- linear prediction
- emotion recognition
- vocal tract
- visual data
- visual information
- speaker verification
- multi stream
- sound source
- multimedia
- noisy speech
- audio features
- smoothing algorithm
- multi channel
- speech recognition
- denoising
- information extraction
- multiscale