Time-Domain Speech Extraction with Spatial Information and Multi Speaker Conditioning Mechanism.
Jisi ZhangCatalin ZorilaRama DoddipatlaJon BarkerPublished in: ICASSP (2021)
Keyphrases
- spatial information
- speech recognition
- speaker recognition
- automatic speech recognition
- audio visual
- spatial distribution
- speaker verification
- speaker identification
- spatial relationships
- temporal information
- local binary pattern
- intensity values
- frequency domain
- automatic speech recognition systems
- prosodic features
- speech signal
- spatial resolution
- spatial relations
- speaker diarization
- text to speech
- hidden markov models
- region connection calculus
- audio stream
- speaker dependent
- speech recognizer
- speaker independent
- broadcast news
- speech synthesis
- speaker adaptation
- topological information
- acoustic features
- spatial arrangement