Crowd-Sourced Speech Corpora for Javanese, Sundanese, Sinhala, Nepali, and Bangladeshi Bengali.
Oddur KjartanssonSupheakmungkol SarinKnot PipatsrisawatMartin JanscheLinne HaPublished in: SLTU (2018)
Keyphrases
- crowd sourced
- statistical machine translation
- news corpus
- crowd sourcing
- speech recognition
- named entities
- natural language processing
- speech signal
- active learning
- object retrieval
- named entity recognition
- machine translation
- language model
- data analysis
- social networking
- cross language
- multiscale
- image classification
- speaker identification
- data model
- natural language