Mimicking Human Process: Text Representation via Latent Semantic Clustering for Classification.
Xiaoye TanRui YanChongyang TaoMingrui WuPublished in: CoRR (2019)
Keyphrases
- text representation
- latent semantic
- text classification
- unsupervised learning
- clustering algorithm
- document clustering
- feature selection
- image classification
- k means
- machine learning
- active learning
- information extraction
- collaborative filtering
- language model
- feature vectors
- high dimensional
- object recognition
- document collections
- clustering method
- cluster analysis
- feature extraction
- latent semantic analysis
- knowledge base