Rich Document Representation for Document Clustering.
Azam JalaliFarhad OroumchianPublished in: RIAO (2004)
Keyphrases
- document representation
- document clustering
- vector space model
- text documents
- text mining
- clustering algorithm
- latent semantic indexing
- document collections
- document categorization
- clustering method
- bag of words
- query expansion
- k means
- active learning
- prior knowledge
- background knowledge
- data fusion
- named entities
- digital libraries
- high level
- search engine
- machine learning
- databases