From Controlled Document Authoring to Interactive Document Normalization.
Aurélien MaxPublished in: COLING (2004)
Keyphrases
- document clustering
- retrieval systems
- document images
- information retrieval
- document classification
- document retrieval
- information retrieval systems
- keywords
- document collections
- web documents
- preprocessing
- tf idf
- document processing
- document content
- digital documents
- cf loadingtexthtml
- relational databases
- digital libraries
- data structure
- ranked list
- data mining
- document representation
- normalization method