ZAEBUC-Spoken: A Multilingual Multidialectal Arabic-English Speech Corpus.
Injy HamedFadhl EryaniDavid PalfreymanNizar HabashPublished in: CoRR (2024)
Keyphrases
- speech corpus
- automatic speech recognition
- broadcast news
- language resources
- cross language
- speech recognition
- spoken document retrieval
- cross lingual
- cross language ir
- speech synthesis
- language specific
- cross language information retrieval
- hidden markov models
- speech signal
- parallel corpus
- speech retrieval
- multilingual information retrieval
- text to speech
- machine translation
- out of vocabulary
- question answering
- spoken language
- machine translation system
- language independent
- text retrieval
- natural language
- query translation
- non stationary
- document collections
- information access
- arabic language
- text categorization
- digital libraries
- image processing