Classification of documents by form and content.
Gerd MaderlechnerPeter SudaThomas BrücknerPublished in: Pattern Recognit. Lett. (1997)
Keyphrases
- document classification
- web documents
- textual content
- metadata
- pre classified
- classification accuracy
- decision trees
- pattern recognition
- feature extraction
- automatic classification
- semantic information
- support vector
- classification method
- multimedia documents
- document content
- text documents
- information retrieval
- feature selection
- classification algorithm
- machine learning
- classify documents
- supervised learning
- text classifiers
- textual information
- information retrieval systems
- image classification
- digital objects
- web content
- text classification
- xml documents
- effective retrieval
- document structure
- content and structure
- relevant documents
- support vector machine svm
- class labels