Building k-nn graphs from large text data.
Thibault DebattyPietro MichiardiOlivier ThonnardWim MeesPublished in: IEEE BigData (2014)
Keyphrases
- knn
- text data
- text classification
- k nearest neighbor
- graph construction
- nearest neighbor
- text categorization
- text documents
- text mining
- distance function
- high dimensional data
- structured data
- high dimensional
- similarity search
- k nearest neighbour
- document collections
- labeled data
- bag of words
- database
- k nearest
- feature selection
- feature extraction
- learning algorithm
- neural network
- voting methods
- document classification
- unlabeled data
- n gram
- index structure
- unsupervised learning
- information retrieval systems
- support vector machine
- probabilistic model
- data sets