Automatic Feature Selection with Applications to Script Identification of Degraded Documents.
Vitaly AblavskyMark R. StevensPublished in: ICDAR (2003)
Keyphrases
- document collections
- xml documents
- information retrieval
- document classification
- document retrieval
- keywords
- metadata
- web documents
- text documents
- website
- information retrieval systems
- relevant documents
- plagiarism detection
- vector space model
- digital documents
- document analysis
- text analysis
- document representation
- database
- search engine
- web data
- retrieval effectiveness
- query terms
- latent semantic analysis
- vector space
- electronic documents
- text mining