Distance Based Strategy for Supervised Document Image Classification.
Fabien CarmagnacPierre HérouxÉric TrupinPublished in: SSPR/SPR (2004)
Keyphrases
- image classification
- codebook generation
- bag of words
- information retrieval
- feature extraction
- retrieval systems
- document images
- image representation
- document collections
- machine learning
- semi supervised
- supervised classification
- information retrieval systems
- unsupervised learning
- image features
- sparse representation
- web documents
- visual features
- learning algorithm
- distance measure
- outlier detection
- relevant documents
- document retrieval
- sparse coding
- tf idf
- natural language processing
- class specific
- multiscale
- multi label
- document classification
- document analysis
- document content