How Good is Good Enough?: Quantifying the Effects of Training Set Quality.
Benjamin SwanMelanie LaverdiereHsiuhan Lexie YangPublished in: GeoAI@SIGSPATIAL (2018)
Keyphrases
- training set
- high quality
- active learning
- data sets
- low quality
- cross validation
- supervised learning
- nearest neighbor
- clustering algorithm
- training data
- classification accuracy
- support vector machine
- software development
- test set
- test data
- artificial intelligence
- training samples
- error rate
- databases
- data quality
- database
- higher quality