Clustering mixed-type data using a probabilistic distance algorithm.
Cristina TortoraFrancesco PalumboPublished in: Appl. Soft Comput. (2022)
Keyphrases
- k means
- input data
- data sets
- clustering method
- clustering analysis
- data reduction
- data points
- spectral clustering
- data sources
- np hard
- synthetic datasets
- data analysis
- optimal solution
- distance matrix
- high dimensional data
- cluster centers
- noisy data
- uncertain data
- data clustering
- detection algorithm
- similarity matrix
- cluster structure
- hierarchical clustering algorithm
- probabilistic model
- clustering algorithm
- categorical data
- minimum distance
- segmentation algorithm
- dynamic programming
- hamming distance
- clustering result
- information theoretic
- large scale data sets
- dissimilarity matrix
- dissimilarity measure
- similarity function
- hierarchical clustering
- cluster analysis
- distance metric
- missing data
- distance function
- expectation maximization
- probability distribution
- data structure
- training data
- learning algorithm