KMEANS Algorithm Clustering for Massive AIS Data Based on the Spark Platform.
Xiumin ChuJinyu LeiXinglong LiuZhiyuan WangPublished in: CRC (2020)
Keyphrases
- k means
- clustering analysis
- cluster centers
- input data
- clustering result
- data clustering
- data analysis
- clustering method
- data points
- clustering algorithm
- spectral clustering
- np hard
- preprocessing
- data sets
- learning algorithm
- dissimilarity matrix
- data reduction
- optimal solution
- dynamic programming
- similarity matrix
- expectation maximization
- training data
- objective function
- categorical data
- data structure
- data sources
- similarity function
- spectral methods
- similarity measure
- synthetic datasets
- information loss
- data objects
- computational complexity
- data distribution
- detection algorithm
- worst case
- noisy data
- original data
- high dimensional
- clustering quality
- search space
- simulated annealing
- large scale data sets
- rows and columns
- database