Speaker Activity Driven Neural Speech Extraction.
Marc DelcroixKaterina ZmolíkováTsubasa OchiaiKeisuke KinoshitaTomohiro NakataniPublished in: ICASSP (2021)
Keyphrases
- speech recognition
- speaker recognition
- audio visual
- automatic speech recognition
- speaker verification
- speaker identification
- speech signal
- prosodic features
- speaker dependent
- speaker diarization
- vocal tract
- noisy environments
- automatic speech recognition systems
- multi modal
- neural network
- network architecture
- speaker adaptation
- information extraction
- hidden markov models
- speech synthesis
- bio inspired
- human activities
- automatic transcription
- data driven
- text to speech
- speech recognizer
- speech sounds
- synthesized speech
- multimedia
- acoustic features
- neural model
- automatic extraction
- vector quantization
- language model
- recognition engine
- speaker independent
- emotion recognition
- activity recognition
- phoneme recognition
- visual information
- visual features