Refine the Corpora Based on Document Manifold.
Chengwei YaoYilin WangGencai ChenPublished in: ADMA (1) (2013)
Keyphrases
- document corpus
- manifold learning
- document images
- document collections
- word frequency
- information retrieval systems
- text corpus
- document clustering
- retrieval systems
- high dimensional
- information retrieval
- document classification
- text collections
- web documents
- natural language processing
- database
- tf idf
- document analysis
- topic segmentation
- document retrieval
- text documents
- related documents
- relevant documents
- relevance feedback
- machine learning