Applying an existing machine learning algorithm to text categorization.
Isabelle MoulinierJean-Gabriel GanasciaPublished in: Learning for Natural Language Processing (1995)
Keyphrases
- text categorization
- learning algorithm
- unlabeled data
- feature selection
- text classification
- knn
- multi label
- information gain
- k nearest neighbor
- machine learning
- reuters corpus
- automated text categorization
- naive bayes
- training data
- document categorization
- feature weighting
- automatic text categorization
- text classifiers
- semantic browsing
- text documents
- semi supervised learning
- tf idf
- text collections
- learning process
- data sets
- classification algorithm
- generalization error
- active learning
- term frequency
- labeled data
- supervised learning
- reinforcement learning
- data mining