Text Categorization for Assessing Multiple Documents Integration, or John Henry Visits a Data Mine.
Peter M. HastingsSimon HughesJoseph MaglianoSusan GoldmanKimberly A. LawlessPublished in: AIED (2011)
Keyphrases
- text categorization
- document classification
- data sets
- textual data
- text documents
- document categorization
- data analysis
- information gain
- automatic categorization
- text classification
- unlabeled data
- training data
- metadata
- feature selection
- training documents
- text classifiers
- automatic text categorization
- term frequency
- document collections
- semi supervised learning
- knn
- data points
- data mining