Rule based English-Vietnamese bilingual terminology extraction from Vietnamese documents.
Ha Nguyen TienQuyen The NgoHuyen Nguyen Thi MinhLinh Ha MyPublished in: SoICT (2019)
Keyphrases
- terminology extraction
- comparable corpora
- cross language information retrieval
- document clustering
- document representation
- parallel corpora
- machine translation
- parallel corpus
- cross language
- document collections
- text documents
- cross lingual
- news articles
- query terms
- multiword
- textual data
- named entity recognition
- text corpora
- language modeling
- query translation
- source language
- text retrieval
- monolingual retrieval
- word pairs
- xml documents
- information retrieval
- bilingual dictionaries
- text mining
- document retrieval
- information retrieval systems
- text collections
- translation model
- statistical machine translation
- information extraction
- linguistic resources
- co occurrence
- language model
- language independent
- training corpus
- query expansion
- clustering method
- named entities
- natural language
- relevant documents
- machine learning