Dimension reduction based on centroids and least squares for efficient processing of text data.
Moongu JeonHaesun ParkJ. Ben RosenPublished in: SDM (2001)
Keyphrases
- text data
- dimension reduction
- efficient processing
- least squares
- high dimensional
- high dimensional data
- data points
- singular value decomposition
- low dimensional
- query processing
- range queries
- text mining
- text classification
- dimensionality reduction
- high dimensionality
- efficient implementation
- linear discriminant analysis
- principal component analysis
- feature space
- k means
- feature extraction
- clustering algorithm
- text documents
- nearest neighbor
- join algorithms
- cluster analysis
- multi dimensional
- structured data
- data sets
- document collections
- similarity search
- unsupervised learning
- database systems
- database
- data analysis
- data distribution
- web pages
- search engine
- reinforcement learning
- probabilistic model
- neural network
- optical flow
- databases
- feature vectors