Detecting and Preventing Confused Labels in Crowdsourced Data.
Evgeny KrivosheevSiarhei BykauFabio CasatiSunil PrabhakarPublished in: Proc. VLDB Endow. (2020)
Keyphrases
- statistical analysis
- data analysis
- data sets
- low quality
- complex data
- data quality
- data distribution
- data sources
- synthetic data
- neural network
- historical data
- spatial data
- data collection
- input data
- query processing
- prior knowledge
- data structure
- high quality
- training data
- small number
- knowledge discovery
- data points
- end users
- missing data
- sensor data
- data objects
- learning algorithm