Randomized self-updating process for clustering large-scale data.
Shang-Ying ShiuYen-Shiu ChinSzu-Han LinTing-Li ChenPublished in: Stat. Comput. (2024)
Keyphrases
- spectral clustering
- data analysis
- data points
- data sets
- redundant data
- multidimensional data
- data objects
- raw data
- high dimensional data
- high quality
- database systems
- training data
- data distribution
- small number
- experimental data
- statistical analysis
- clustering algorithm
- original data
- data clustering
- data processing
- clustering analysis
- end users
- data mining tasks
- data mining
- cluster centers
- prior knowledge
- database
- data quality
- missing data
- image data
- knowledge discovery
- xml documents
- data sources