A Similarity Rough Set Model for Document Representation and Document Clustering.
Nguyen Chi ThanhKoichi YamadaMuneyuki UneharaPublished in: J. Adv. Comput. Intell. Intell. Informatics (2011)
Keyphrases
- document representation
- document clustering
- document similarity
- vector space model
- document collections
- text mining
- text documents
- clustering algorithm
- similarity measure
- clustering method
- bag of words
- k means
- language model
- machine learning
- real world
- domain knowledge
- multiscale
- web documents
- semantic information
- semantic similarity