MirasVoice: A bilingual (English-Persian) speech corpus.
Amir VahebAli Janalizadeh ChoobbastiMahdi MortazaviSaeid SafaviBehnam SabetiPublished in: LREC (2018)
Keyphrases
- cross lingual
- speech corpus
- machine translation
- cross language
- spoken document retrieval
- text retrieval
- parallel corpus
- chinese english
- cross language information retrieval
- query translation
- parallel corpora
- automatic speech recognition
- english chinese
- language resources
- speech synthesis
- language independent
- word sense disambiguation
- comparable corpora
- word alignment
- text classification
- target language
- cross language retrieval
- statistical machine translation
- machine translation system
- broadcast news
- language modeling
- source language
- text to speech
- sentence pairs
- document retrieval
- multiword
- bilingual dictionaries
- news articles
- document collections
- indian languages
- information access
- natural language processing
- information extraction
- natural language
- word segmentation
- feature selection
- question answering
- n gram
- speech recognition
- retrieval model