Comparison of forced-alignment speech recognition and humans for generating reference VAD.
Ivan KraljevskiZheng-Hua TanMaria Paola BissiriPublished in: INTERSPEECH (2015)
Keyphrases
- speech recognition
- noisy environments
- voice activity detection
- hidden markov models
- speech signal
- language model
- speech synthesis
- speech processing
- speech recognizer
- pattern recognition
- automatic speech recognition
- handwriting recognition
- speech recognition technology
- speech recognition systems
- speaker recognition
- speaker identification
- speaker independent
- speech understanding
- digit recognition
- speaker dependent
- keyword spotting
- speech recognizers
- isolated word