MiCS-P: Parallel mutual-information computation of big categorical data on spark.
Junli LiChaowei ZhangJifu ZhangXiao QinLihua HuPublished in: J. Parallel Distributed Comput. (2022)
Keyphrases
- categorical data
- mutual information
- cluster analysis
- numerical data
- parameter free
- parallel computation
- numeric data
- information theoretic
- image registration
- feature selection
- hierarchical latent class models
- similarity measure
- information gain
- attribute values
- density based clustering
- big data
- numerical attributes
- image analysis
- object recognition
- hierarchical clustering algorithm
- clustering algorithm
- correspondence analysis
- distance based outlier detection
- information retrieval