Prompting the data transformation activities for cluster analysis on collections of documents.
Tania CerquitelliEvelina Di CorsoFrancesco VenturaSilvia ChiusanoPublished in: SEBD (2017)
Keyphrases
- cluster analysis
- data transformation
- document collections
- data mining
- document clustering
- information retrieval
- data integration
- categorical data
- privacy preserving
- clustering algorithm
- data quality
- information retrieval systems
- clustering method
- privacy preserving data mining
- data mining techniques
- metadata
- data warehousing
- text collections
- data analysis
- xml documents
- unsupervised learning
- functional dependencies
- k means
- dimension reduction
- dimensionality reduction
- hierarchical latent class models
- cluster validity
- vector space model
- text documents
- data mining algorithms
- digital libraries
- privacy preservation
- data management
- association rules
- random projections
- data sets
- database
- knowledge discovery
- active learning
- relational databases
- machine learning