A MinHash Approach for Clustering Large Collections of Binary Programs.
Ciprian OprisaPublished in: CSCS (2015)
Keyphrases
- clustering algorithm
- k means
- clustering method
- binary vectors
- information retrieval
- hierarchical clustering
- categorical data
- graph theoretic
- data clustering
- anomaly detection
- document collections
- cluster analysis
- fuzzy clustering
- data sets
- high dimensional
- digital libraries
- source code
- self organizing maps
- spectral clustering