Multiword Expressions and Named Entities in the Wiki50 Corpus.
Veronika VinczeIstván Nagy T.Gábor BerendPublished in: RANLP (2011)
Keyphrases
- named entities
- multiword
- lexical units
- natural language processing
- context sensitive
- named entity recognition
- information extraction
- co occurrence
- annotated corpus
- wordnet
- text mining
- question answering
- relation extraction
- text clustering
- part of speech
- text documents
- natural language
- language model
- genia corpus
- document representation
- machine learning
- semantic knowledge
- unsupervised learning
- knowledge discovery
- information retrieval
- data mining