Classification automatique de documents. La mesure des deux écarts.
Simon JailletJacques ChauchéViolaine PrinceMaguelonne TeisseirePublished in: INFORSID (2003)
Keyphrases
- document classification
- classification accuracy
- machine learning
- pattern recognition
- information retrieval
- classification models
- automatic classification
- document collections
- document categorization
- support vector machine
- automatic categorization
- decision trees
- benchmark datasets
- machine learning algorithms
- support vector machine svm
- xml documents
- text documents
- classification algorithm
- classify documents
- classification scheme
- web documents
- support vector
- classification systems
- text classification
- image classification
- information retrieval systems
- preprocessing
- supervised learning
- feature vectors
- document analysis
- pre classified
- test collection
- class labels
- text categorization
- model selection
- feature set
- probabilistic model
- keywords
- feature selection