Large Vocabulary Read Speech Corpora for Four Ethiopian Languages: Amharic, Tigrigna, Oromo and Wolaytta.
Solomon Teferra AbateMartha Yifiru TachbelieMichael MeleseHafte AberaTewodros AbebeWondwossen MulugetaYaregal AssabieMillion MesheshaSolomon AfnafuBinyam Ephrem SeyoumPublished in: LREC (2020)
Keyphrases
- speech recognition
- spoken language
- speech recognizer
- continuous speech recognition
- multi lingual
- linguistic resources
- speech recognition systems
- english text
- statistical machine translation
- speech signal
- speech synthesis
- natural language processing
- language independent
- cross language retrieval
- sign language recognition
- expressive power
- speaker adaptation
- comparable corpora
- language model
- speaker independent
- hidden markov models
- parallel corpora
- pattern recognition
- audio visual
- automatic speech recognition
- cross language information retrieval
- text summarization
- text to speech
- target language
- sign language
- english language
- speaker recognition
- chinese english
- news items
- vocal tract
- language modeling
- machine translation
- query expansion
- information retrieval systems