CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages.
Kyubyong ParkThomas MulcPublished in: INTERSPEECH (2019)
Keyphrases
- speech recognition
- audio visual
- speaker recognition
- speaker identification
- speaker verification
- english text
- language identification
- multi lingual
- automatic speech recognition
- speaker diarization
- language independent
- automatic speech recognition systems
- database
- benchmark datasets
- shape representation
- expressive power
- speech signal
- gaussian mixture model
- document collections
- acoustic features
- multi modal
- databases