NESM: a Named Entity based Proximity Measure for Multilingual News Clustering.
Soto MontalvoVíctor FresnoRaquel Martínez-UnanuePublished in: Proces. del Leng. Natural (2012)
Keyphrases
- named entities
- proximity measures
- random walk
- named entity recognition
- named entity extraction
- unsupervised learning
- information extraction
- relation extraction
- distance measure
- question answering
- similarity measure
- co occurrence
- natural language processing
- link prediction
- text mining
- clustering algorithm
- neighborhood structure
- k means
- annotated corpus
- clustering method
- natural language
- edit distance
- query terms
- data sets
- dimensionality reduction
- knn
- pairwise
- graphical models