Building a Named Entity Annotated Bilingual English-Vietnamese Corpus.
Tuan-An DaoHung-Thinh TruongLong H. B. NguyenDien DinhPublished in: KSE (2018)
Keyphrases
- annotated corpus
- named entities
- named entity recognition
- proper names
- genia corpus
- person names
- machine translation
- information extraction
- natural language processing
- relation extraction
- linguistic features
- question answering
- cross language
- manually annotated
- noun phrases
- cross lingual
- co occurrence
- parallel corpus
- named entity extraction
- text mining
- named entity recognizer
- statistical machine translation
- chinese english
- cross language information retrieval
- sentence pairs
- maximum entropy
- parallel corpora
- natural language
- word alignment
- automatic annotation
- text documents
- conditional random fields
- semi supervised
- comparable corpora
- unsupervised learning
- named entity disambiguation
- machine learning
- semantic role labeling
- query translation
- language modeling
- query expansion
- text classification