Scalable k-NN based text clustering.
Alessandro LulliThibault DebattyMatteo Dell'AmicoPietro MichiardiLaura RicciPublished in: IEEE BigData (2015)
Keyphrases
- knn
- text clustering
- text categorization
- k nearest neighbor
- text classification
- nearest neighbor
- k nearest neighbour
- text mining
- hierarchical clustering
- document clustering
- text documents
- text data
- distance function
- text collections
- similarity search
- k means
- background knowledge
- clustering algorithm
- feature selection
- information gain
- metric learning
- neural network
- wordnet
- support vector machine
- data sets
- vector space model
- multi label
- bag of words
- user feedback
- data structure
- self organizing maps
- learning algorithm
- semi supervised