Classification of Documents by Content.
Simon JailletMaguelonne TeisseireJacques ChauchéViolaine PrincePublished in: IEEE ICCI (2003)
Keyphrases
- document classification
- metadata
- pattern recognition
- information retrieval
- web documents
- textual content
- classification accuracy
- automatic categorization
- document content
- support vector
- pre classified
- textual features
- semantic content
- classification method
- decision trees
- feature selection
- multimedia documents
- structured documents
- automatic classification
- document retrieval
- multimedia
- xml documents
- document collections
- support vector machine svm
- image classification
- web search
- machine learning
- training set
- document categorization
- semantic relevance
- feature extraction
- feature space
- digital objects
- active learning
- document clustering
- text documents
- feature vectors
- information retrieval systems
- support vector machine