Simultaneous Speech Recognition and Speaker Diarization for Monaural Dialogue Recordings with Target-Speaker Acoustic Models.
Naoyuki KandaShota HoriguchiYusuke FujitaYawen XueKenji NagamatsuShinji WatanabePublished in: CoRR (2019)
Keyphrases
- speech recognition
- speaker diarization
- acoustic models
- automatic speech recognition
- speaker independent
- speech recognizer
- broadcast news
- hidden markov models
- language model
- speech signal
- speaker identification
- pattern recognition
- acoustic features
- dialogue system
- word error rate
- natural language
- spoken language
- noisy environments
- speech recognition systems
- image processing