On verifying the authenticity of e-commercial crawling data by a semi-crosschecking method.
Tran Khanh DangDuc Minh Chau PhamDuc Dan HoPublished in: Int. J. Web Inf. Syst. (2019)
Keyphrases
- synthetic data
- input data
- noisy data
- data analysis
- statistical methods
- data collection
- image data
- data sets
- correlation analysis
- prior information
- clustering method
- training samples
- high accuracy
- dynamic programming
- database
- high quality
- significant improvement
- prior knowledge
- computational complexity
- preprocessing
- computational cost
- user input
- data quality
- test data
- data distribution
- objective function
- pairwise
- data points
- data sources
- model selection
- detection method
- high dimensional data
- segmentation method
- data processing
- data mining techniques
- support vector machine
- similarity measure
- spectral clustering
- decision trees
- cost function
- information loss