A new hybrid metric for verifying parallel corpora of Arabic-English.
Saad AlkahtaniWei LiuWilliam John TeahanPublished in: CoRR (2015)
Keyphrases
- parallel corpora
- machine translation
- cross language information retrieval
- english chinese
- cross lingual
- comparable corpora
- statistical machine translation
- cross language
- language independent
- machine translation system
- language resources
- bilingual dictionaries
- cross lingual information retrieval
- sentence pairs
- labor intensive
- query translation
- sentence level
- parallel corpus
- word pairs
- wikipedia articles
- language identification
- word level
- natural language processing
- news articles
- document collections
- text mining