Utilizing supervised models to infer consensus labels and their quality from data with multiple annotators.
Hui Wen GohUlyana TkachenkoJonas MuellerPublished in: CoRR (2022)
Keyphrases
- high quality
- data quality
- data sets
- low quality
- experimental data
- training data
- historical data
- data analysis
- database
- data processing
- data sources
- data collection
- high dimensional data
- multiple sources
- machine learning
- prior knowledge
- missing data
- synthetic data
- raw data
- image data
- probability distribution
- xml documents