Dataset Cartography: Mapping and Diagnosing Datasets with Training Dynamics.
Swabha SwayamdiptaRoy SchwartzNicholas LourieYizhong WangHannaneh HajishirziNoah A. SmithYejin ChoiPublished in: CoRR (2020)
Keyphrases
- training dataset
- benchmark datasets
- text classification tasks
- synthetic datasets
- uci datasets
- ground truth labels
- training set
- massive datasets
- high dimensional datasets
- restricted boltzmann machine
- pascal voc
- computer graphics
- recurrent networks
- standard learning algorithms
- training data
- linear svm
- dynamic model
- dynamical systems
- training phase
- active learning
- class imbalanced data
- million images
- database
- training process
- supervised learning
- data sets
- uci machine learning repository
- experimental study
- high dimensional
- machine learning
- real world