DCDistance: A Supervised Text Document Feature extraction based on class labels.
Charles Henrique Porto FerreiraDebora Maria Rossi de MedeirosFabrício Olivetti de FrançaPublished in: CoRR (2018)
Keyphrases
- class labels
- text documents
- feature extraction
- supervised learning
- feature selection
- feature set
- text classification
- label information
- text mining
- text categorization
- labeled data
- document classification
- classification algorithm
- keywords
- unsupervised learning
- semi supervised
- multi label
- information extraction
- training data
- topic models
- feature vectors
- unlabeled data
- document clustering
- learning algorithm
- bag of words
- wordnet
- machine learning
- image processing
- image classification
- training examples
- training set
- named entities
- document representation
- active learning
- feature space
- natural language processing
- semi supervised learning
- principal component analysis
- databases
- image segmentation
- face recognition
- support vector
- knn
- dimensionality reduction