The Clustering-Based Initialization for Non-negative Matrix Factorization in the Feature Transformation of the High-Dimensional Text Categorization System: A Viewpoint of Term Vectors.
Le Nguyen Hoai NamHo Bao QuocPublished in: TPDL (2017)
Keyphrases
- text categorization
- negative matrix factorization
- high dimensional
- principal component analysis
- document clustering
- feature selection
- matrix factorization
- low dimensional
- sparse representation
- knn
- dimensionality reduction
- text classification
- text documents
- feature space
- k nearest neighbor
- principal components
- feature vectors
- vector space
- document representation
- nearest neighbor
- semi supervised learning
- high dimensional data
- k means
- input data
- feature extraction
- spectral clustering
- information retrieval
- input space
- kernel function
- data points
- similarity search
- neural network
- knowledge discovery
- collaborative filtering