Real-time multilingual speech recognition and speaker diarization system based on Whisper segmentation.
Ke-Ming LyuRen-yuan LyuHsien-Tsung ChangPublished in: PeerJ Comput. Sci. (2024)
Keyphrases
- speaker diarization
- speech recognition
- bayesian information criterion
- hidden markov models
- pattern recognition
- image segmentation
- automatic speech recognition
- language model
- speaker identification
- speech signal
- handwriting recognition
- noisy environments
- speech recognition systems
- machine learning
- computer vision
- broadcast news