SpreadCluster: recovering versioned spreadsheets through similarity-based clustering.
Liang XuWensheng DouChushu GaoJie WangJun WeiHua ZhongTao HuangPublished in: MSR (2017)
Keyphrases
- clustering algorithm
- clustering method
- k means
- data clustering
- document clustering
- cluster analysis
- information theoretic
- databases
- spectral clustering
- self organizing maps
- information retrieval
- data points
- unsupervised clustering
- graph theoretic
- data mining tasks
- categorical data
- learning algorithm
- fuzzy clustering
- data objects
- relational data
- hierarchical clustering
- database
- neural network
- real world
- website
- high dimensional data
- data model
- unsupervised learning