Triple-D: Denoising Distant Supervision for High-Quality Data Creation.
Xinyi ZhuYongqi ZhangLei ChenKai ChenPublished in: ICDE (2024)
Keyphrases
- high quality
- data sets
- denoising
- data collection
- data analysis
- complex data
- synthetic data
- data processing
- training data
- data quality
- data structure
- end users
- data points
- image data
- light source
- machine learning
- raw data
- data acquisition
- original data
- data distribution
- small number
- high dimensional data
- statistical analysis
- data mining techniques
- semi supervised
- prior knowledge
- multiscale
- decision trees
- search engine
- data mining