A Consistent Hashing Based Data Redistribution Algorithm.
Xiang FuCan PengWeihong HanPublished in: IScIDE (2) (2015)
Keyphrases
- input data
- data sets
- noisy data
- learning algorithm
- data analysis
- cost function
- prior information
- worst case
- np hard
- k means
- globally optimal
- computational complexity
- optimization algorithm
- information loss
- optimal solution
- high dimensional data
- detection algorithm
- incomplete data
- tree structure
- database
- probability distribution
- data distribution
- data sources
- synthetic datasets
- search space
- storage space
- data collection
- dimensional data
- hashing algorithm
- original data
- labeled data
- expectation maximization
- probabilistic model
- computational cost
- high dimensional
- data structure
- bayesian networks