Validity problems in clinical machine learning by indirect data labeling using consensus definitions.
Michael HagmannShigehiko SchamoniStefan RiezlerPublished in: CoRR (2023)
Keyphrases
- machine learning
- data sets
- training data
- data analysis
- data points
- data collection
- raw data
- application domains
- knowledge discovery
- database
- knowledge acquisition
- big data
- data quality
- background knowledge
- missing data
- synthetic data
- data sources
- active learning
- data structure
- high quality
- decision trees
- data mining
- image data
- computer systems
- unsupervised learning
- end users
- domain experts
- statistical methods
- data objects
- data mining applications
- computational biology