DISC: Data-Intensive Similarity Measure for Categorical Data.
Aditya DesaiHimanshu SinghVikram PudiPublished in: PAKDD (2) (2011)
Keyphrases
- data intensive
- categorical data
- similarity measure
- hierarchical clustering algorithm
- cluster analysis
- data management
- parameter free
- numerical data
- data access
- big data
- web services
- clustering method
- earth science
- attribute values
- pairwise
- hierarchical latent class models
- decision making
- clustering algorithm
- grid computing
- database systems
- data mining
- distance based outlier detection
- data warehouse
- association rules
- data sets
- database