On Word Error Rate Definitions and their Efficient Computation for Multi-Speaker Speech Recognition Systems.
Thilo von NeumannChristoph BöddekerKeisuke KinoshitaMarc DelcroixReinhold Haeb-UmbachPublished in: CoRR (2022)
Keyphrases
- speech recognition
- efficient computation
- word error rate
- speech recognition systems
- automatic speech recognition
- speech recognizer
- language model
- handwriting recognition
- pattern recognition
- speech signal
- speaker identification
- computational efficiency
- hidden markov models
- noisy environments
- speaker diarization
- broadcast news
- speaker recognition
- mel frequency cepstral coefficients
- language independent
- neural network
- n gram
- error rate
- collaborative filtering
- feature extraction
- image processing
- machine learning