Self-tuning techniques for large scale cluster analysis on textual data collections.
Evelina Di CorsoTania CerquitelliFrancesco VenturaPublished in: SAC (2017)
Keyphrases
- cluster analysis
- data collections
- data collection
- categorical data
- semi structured
- data analysis
- clustering algorithm
- clustering method
- fuzzy clustering
- data mining techniques
- factor analysis
- k means
- unsupervised learning
- cluster validity
- data mining
- real world
- fuzzy c means
- multimedia
- document collections
- hierarchical latent class models
- data sets
- xml data
- data sources
- natural language
- metadata
- multi dimensional
- semi supervised
- databases