Efficient mixture model for clustering of sparse high dimensional binary data.
Marek SmiejaKrzysztof HajtoJacek TaborPublished in: CoRR (2017)
Keyphrases
- feature space
- high dimensional
- binary data
- mixture model
- gaussian mixture model
- high dimensionality
- dimensionality reduction
- data points
- low dimensional
- feature vectors
- em algorithm
- high dimensional data
- mixture modeling
- probabilistic model
- density estimation
- variable selection
- nearest neighbor
- generative model
- feature selection
- model selection
- expectation maximization
- unsupervised learning
- sparse coding
- feature extraction
- categorical data
- bayesian information criterion
- maximum likelihood
- formal concept analysis
- bayesian networks
- model based clustering
- language model
- cluster analysis
- k means
- continuous data
- dirichlet process
- semi supervised learning
- data sets
- probabilistic mixture model