KT-Speech-Crawler: Automatic Dataset Construction for Speech Recognition from YouTube Videos.
Egor LakomkinSven MaggCornelius WeberStefan WermterPublished in: EMNLP (Demonstration) (2018)
Keyphrases
- speech recognition
- youtube videos
- automatic speech recognition
- speech synthesis
- speech signal
- hidden markov models
- speech recognizer
- language model
- speech processing
- pattern recognition
- speech recognition technology
- speech recognition systems
- speech recognizers
- event detection
- speaker identification
- website
- search engine
- recognition engine
- keyword spotting
- speaker independent
- noisy environments
- word error rate
- speech recognition errors
- isolated word
- cepstral coefficients
- speaker adaptation
- speaker recognition
- web pages
- speaker dependent
- acoustic models
- speech retrieval
- user interaction