An Efficient Data Extracting Method Based on Hadoop.
Lianchao CaoZhanqiang LiKaiyuan QiGuomao XinDong ZhangPublished in: CloudComp (2014)
Keyphrases
- synthetic data
- input data
- data processing
- data sets
- statistical methods
- noisy data
- image data
- significant improvement
- detection method
- test data
- prior knowledge
- cost function
- high quality
- user input
- preprocessing
- missing data
- raw data
- knowledge discovery
- data analysis
- database
- data sources
- training data
- data collection
- correlation analysis
- data distribution
- prior information
- big data
- original data
- massive scale
- segmentation method
- data management
- feature set
- data points
- classification accuracy
- pairwise
- data structure
- objective function
- similarity measure
- data mining
- neural network