A highly accurate Named Entity corpus for Hungarian.
György SzarvasRichárd FarkasLászló FelföldiAndrás KocsorJános CsirikPublished in: LREC (2006)
Keyphrases
- highly accurate
- named entities
- annotated corpus
- person names
- linguistic features
- genia corpus
- news corpus
- noun phrases
- named entity recognition
- named entity disambiguation
- relation extraction
- information extraction
- co occurrence
- natural language processing
- named entity extraction
- text mining
- capable of producing
- high quality
- personal names
- question answering
- high accuracy
- contextual features
- text documents
- unsupervised learning
- automatic annotation
- coreference resolution
- language independent
- accurate models
- maximum entropy model
- real world
- proper names
- domain specific