Preparatory Work on Automatic Extraction of Bilingual Multi-Word Units from Parallel Corpora.
Boxing ChenLimin DuPublished in: Int. J. Comput. Linguistics Chin. Lang. Process. (2003)
Keyphrases
- automatic extraction
- parallel corpora
- multiword
- statistical machine translation
- bilingual dictionaries
- biomedical literature
- term extraction
- machine translation
- cross lingual
- cross language information retrieval
- context sensitive
- comparable corpora
- relation extraction
- language independent
- labor intensive
- machine translation system
- language model
- word pairs
- text clustering
- natural language text
- wikipedia articles
- query translation
- semantic knowledge
- search engine
- part of speech
- cross language
- text mining