Efficient mixture model for clustering of sparse high dimensional binary data.
Marek SmiejaKrzysztof HajtoJacek TaborPublished in: Data Min. Knowl. Discov. (2019)
Keyphrases
- binary data
- high dimensional
- mixture model
- categorical data
- gaussian mixture model
- probabilistic model
- high dimensional data
- generative model
- data points
- dimensionality reduction
- high dimensionality
- unsupervised learning
- density estimation
- low dimensional
- em algorithm
- mixture modeling
- expectation maximization
- model selection
- maximum likelihood
- language model
- formal concept analysis
- data sets
- model based clustering
- clustering method
- similarity measure
- variable selection
- clustering algorithm
- continuous data
- probabilistic mixture model
- bayesian information criterion
- sparse coding
- data structure
- information retrieval