An efficient approach for dimensionality reduction and classification of high dimensional text documents.
Kotte VinayKumarR. SrinivasanElijah Blessing SinghPublished in: DATA (2018)
Keyphrases
- text documents
- high dimensional
- text mining
- text data
- text classification
- information extraction
- low dimensional
- text categorization
- keywords
- topic models
- named entities
- wordnet
- news articles
- document classification
- dimensionality reduction
- bag of words
- document clustering
- multiscale
- nearest neighbor
- co occurrence
- data points
- artificial intelligence
- high dimensional data
- automatic text categorization
- information extraction systems
- machine learning
- document collections
- image classification
- natural language processing
- image features
- active learning
- pairwise
- similarity measure
- learning algorithm
- information retrieval