Audio-Visual Speech Enhancement with Selective Off-Screen Speech Extraction.
Tomoya YoshinagaKeitaro TanakaShigeo MorishimaPublished in: EUSIPCO (2023)
Keyphrases
- audio visual
- speech enhancement
- multi modal
- noisy environments
- speech signal
- noise reduction
- signal to noise ratio
- single channel
- visual information
- emotion recognition
- multi stream
- audio features
- visual data
- audio visual speech recognition
- sound source
- noisy speech
- multimedia
- linear prediction
- background noise
- speaker verification
- vocal tract
- smoothing algorithm
- speech recognition
- information extraction
- edge detection
- computer vision
- automatic speech recognition
- metadata
- high level
- speaker identification
- wiener filter
- hidden markov models