Mining Large-scale Parallel Corpora from Multilingual Patents: An English-Chinese example and its application to SMT.
Bin LuBenjamin Ka-Yin T'souTao JiangOi Yee KwongJingbo ZhuPublished in: CIPS-SIGHAN (2010)
Keyphrases
- parallel corpora
- english chinese
- machine translation system
- cross language information retrieval
- statistical machine translation
- cross lingual
- language independent
- machine translation
- cross language
- bilingual dictionaries
- parallel corpus
- query translation
- word alignment
- information retrieval
- translation model
- bi directional
- text classification
- digital libraries
- language modeling
- labor intensive
- question answering
- text mining
- natural language processing
- out of vocabulary
- information access
- wikipedia articles
- word pairs
- statistical model