Towards Pose-Invariant Audio-Visual Speech Enhancement in the Wild for Next-Generation Multi-Modal Hearing Aids.
Mandar GogateKia DashtipourAmir HussainPublished in: ICASSP Workshops (2023)
Keyphrases
- audio visual
- multi modal
- hearing aids
- speech enhancement
- noise reduction
- speech signal
- noisy environments
- signal to noise ratio
- single channel
- linear prediction
- speech recognition
- wiener filter
- multi channel
- emotion recognition
- sound source
- edge detection
- audio features
- vocal tract
- high dimensional
- principal component analysis
- information retrieval
- feature selection