ZAEBUC-Spoken: A Multilingual Multidialectal Arabic-English Speech Corpus.
Injy HamedFadhl EryaniDavid PalfreymanNizar HabashPublished in: LREC/COLING (2024)
Keyphrases
- speech corpus
- automatic speech recognition
- broadcast news
- language resources
- cross language
- spoken document retrieval
- speech recognition
- cross lingual
- cross language ir
- speech retrieval
- speech synthesis
- cross language information retrieval
- hidden markov models
- speech signal
- text to speech
- language specific
- multilingual information retrieval
- spoken language
- digital libraries
- language independent
- text retrieval
- question answering
- document retrieval
- information access
- parallel corpus
- machine translation
- natural language
- image processing
- bilingual dictionaries
- machine translation system
- arabic language