Avoiding Bias in Text Clustering Using Constrained K-means and May-Not-Links.
M. Eduardo AresJavier ParaparAlvaro BarreiroPublished in: ICTIR (2009)
Keyphrases
- text clustering
- k means
- clustering algorithm
- text mining
- document clustering
- hierarchical clustering
- self organizing maps
- clustering method
- clustering quality
- variable weighting
- cluster ensemble
- text categorization
- text data
- metric learning
- background knowledge
- cluster analysis
- data clustering
- text classification
- wordnet
- text collections
- fuzzy c means
- text documents
- document representation
- user feedback
- semantic relations
- latent semantic analysis
- expectation maximization
- spectral clustering
- data analysis
- information retrieval