Combining Statistics and Semantics for Word and Document Clustering.
Alexandre TermierMichèle SebagMarie-Christine RoussetPublished in: Workshop on Ontology Learning (2001)
Keyphrases
- document clustering
- text mining
- document representation
- topic extraction
- clustering algorithm
- negative matrix factorization
- k means
- document collections
- clustering method
- co occurrence
- document similarity
- vector space model
- text documents
- tf idf
- tolerance rough set
- data mining
- n gram
- cluster analysis
- document clusters
- artificial intelligence
- pairwise constraints
- related documents
- information retrieval systems
- ant based clustering
- databases