MixDir: Scalable Bayesian Clustering for High-Dimensional Categorical Data.
Constantin Ahlmann-EltzeChristopher YauPublished in: DSAA (2018)
Keyphrases
- categorical data
- parameter free
- high dimensional
- cluster analysis
- numerical data
- binary data
- numeric data
- categorical attributes
- low dimensional
- similarity search
- attribute values
- k means
- density based clustering
- variable selection
- multivariate time series
- information retrieval
- categorical values
- distance based outlier detection
- high dimensionality
- clustering method
- dimensionality reduction
- probability distribution
- bayesian inference
- outlier detection
- numerical attributes
- feature space
- data analysis
- correspondence analysis
- feature selection
- data sets