Improving Patent Translation using Bilingual Term Extraction and Re-tokenization for Chinese-Japanese.
Wei YangYves LepagePublished in: WAT@COLING (2016)
Keyphrases
- term extraction
- chinese english
- multiword
- english chinese
- machine translation
- cross language information retrieval
- keyword extraction
- query translation
- statistical machine translation
- parallel corpora
- bilingual dictionaries
- automatic extraction
- machine translation system
- multilingual retrieval
- cross language
- character n grams
- wordnet
- cross language retrieval
- ontology learning
- text retrieval
- translation model
- monolingual retrieval
- comparable corpora
- information retrieval
- text mining
- context sensitive
- language model
- target language
- dublin city university
- document collections
- keywords
- semantic relations
- cross lingual
- source language
- query expansion
- n gram
- probabilistic model
- natural language
- named entities
- part of speech