CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages.
Kyubyong ParkThomas MulcPublished in: CoRR (2019)
Keyphrases
- speech recognition
- speaker identification
- audio visual
- speaker recognition
- multi lingual
- automatic speech recognition
- vocal tract
- speaker verification
- expressive power
- speaker dependent
- database
- speech signal
- language independent
- databases
- language identification
- spoken language
- benchmark datasets
- noisy environments
- photo collections
- target language
- broadcast news
- document collections
- multi modal
- pattern recognition
- feature extraction
- english text
- speaker diarization
- feature selection
- text collections