JTubeSpeech: corpus of Japanese speech collected from YouTube for speech recognition and speaker verification.
Shinnosuke TakamichiLudwig KürzingerTakaaki SaekiSayaka ShiotaShinji WatanabePublished in: CoRR (2021)
Keyphrases
- speech recognition
- speaker verification
- noisy environments
- speaker recognition
- speech signal
- automatic speech recognition
- speaker identification
- speech synthesis
- speech recognizer
- language model
- hidden markov models
- speech enhancement
- pattern recognition
- speech recognition systems
- speech processing
- conversational speech
- speech recognition technology
- acoustic features
- language identification
- speaker dependent
- speaker independent
- spontaneous speech
- broadcast news
- audio visual
- action recognition
- machine learning
- information retrieval
- neural network
- speaker adaptation
- feature extraction
- speaker diarization
- maximum likelihood
- noise reduction
- emotion recognition