Using IR Techniques for Text Classification in Document Analysis.
Rainer HochPublished in: SIGIR (1994)
Keyphrases
- document analysis
- text classification
- information retrieval
- text mining
- document images
- character recognition
- text analysis
- image analysis
- document image analysis
- bag of words
- text categorization
- machine learning
- n gram
- text data
- labeled data
- document processing
- knn
- text documents
- feature selection
- query expansion
- word segmentation
- information retrieval systems
- relevance feedback
- electronic documents
- text retrieval
- document image retrieval
- printed documents
- language independent
- k nearest neighbor
- semantic features
- association rules
- pattern recognition
- metadata
- video analysis
- handwritten documents
- language modeling
- document retrieval