An Approach to Construct a Named Entity Annotated English-Vietnamese Bilingual Corpus.
Long H. B. NguyenDien DinhPhuoc TranPublished in: ACM Trans. Asian Low Resour. Lang. Inf. Process. (2016)
Keyphrases
- annotated corpus
- named entities
- proper names
- named entity recognition
- relation extraction
- genia corpus
- machine translation
- manually annotated
- information extraction
- person names
- natural language processing
- cross language
- question answering
- linguistic features
- cross lingual
- co occurrence
- named entity extraction
- text mining
- cross language information retrieval
- noun phrases
- named entity recognizer
- text documents
- parallel corpus
- statistical machine translation
- word alignment
- natural language
- sentence pairs
- target language
- conditional random fields
- maximum entropy
- parallel corpora
- query translation
- unsupervised learning
- data mining
- chinese english
- document retrieval
- multiword
- semantic role labeling
- parse tree