Cross Script Hindi English NER Corpus from Wikipedia.
Mohd Zeeshan AnsariTanvir AhmadAli Md. ArshadPublished in: CoRR (2018)
Keyphrases
- named entity recognition
- named entities
- person names
- named entity disambiguation
- proper names
- annotated corpus
- named entity recognizer
- natural language processing
- language identification
- computing semantic relatedness
- information extraction
- noun phrases
- broad coverage
- indian languages
- relation extraction
- pos tagging
- co occurrence
- world knowledge
- statistical machine translation
- contextual features
- question answering
- text summarization
- machine translation
- open domain
- maximum entropy
- wordnet
- text mining
- english words
- natural language
- chinese named entity recognition
- semantic relations
- conditional random fields
- unsupervised learning
- link grammar
- parallel corpus
- wikipedia articles
- word sense
- machine learning
- natural language text
- text documents
- knowledge representation
- document images