Building a Parallel Multilingual Corpus (Arabic-Spanish-English).
Doaa SamyAntonio Moreno-SandovalJosé María GuiraoEnrique AlfonsecaPublished in: LREC (2006)
Keyphrases
- language identification
- machine translation system
- parallel corpus
- cross lingual
- mono lingual
- machine translation
- cross language
- wide coverage
- cross language information retrieval
- statistical machine translation
- unknown words
- spanish language
- comparable corpora
- question answering
- indian languages
- arabic language
- open domain
- parallel corpora
- query translation
- chinese english
- sentence pairs
- language specific
- link grammar
- language independent
- cross language ir
- cross lingual information retrieval
- person names
- pronominal anaphora
- multilingual information retrieval
- word alignment
- broad coverage
- speaker identification
- language resources
- qa clef
- document images
- english words
- multiword
- natural language
- document collections
- word sense disambiguation
- information access
- target language
- word pairs
- training corpus
- natural language processing
- digital libraries
- morphological analysis
- information retrieval