Heuristic Classification of Office Documents.
Xiaolong HaoJason T. L. WangMichael P. BieberPeter A. NgPublished in: Int. J. Artif. Intell. Tools (1994)
Keyphrases
- document classification
- text classification
- classification algorithm
- pattern recognition
- feature vectors
- image classification
- classification accuracy
- classification method
- pattern classification
- automatic categorization
- decision tree learning algorithm
- benchmark datasets
- decision trees
- machine learning
- web documents
- document collections
- document retrieval
- learning algorithm
- automatic classification
- preprocessing
- search algorithm
- feature selection
- text classifiers
- document clustering
- automatic text classification
- relevant documents
- user queries
- text categorization
- information retrieval systems
- supervised learning
- support vector machine
- xml documents
- training set
- metadata