A Dimensionality Reduction Approach for Semantic Document Classification.
Oskar AhlgrenPekka MaloAnkur SinhaPekka J. KorhonenJyrki WalleniusPublished in: SPIM (2011)
Keyphrases
- document classification
- dimensionality reduction
- text classification
- text categorization
- web documents
- feature selection
- text mining
- classification algorithm
- low dimensional
- feature extraction
- text documents
- high dimensional
- natural language
- topic extraction
- principal component analysis
- principal components
- automatic document classification
- feature space
- databases
- data analysis
- pattern recognition
- multiscale