Distance-based Probabilistic Data Augmentation for Synthetic Minority Oversampling.
Joel GoodmanShahram SarkaniThomas A. MazzuchiPublished in: Trans. Data Sci. (2021)
Keyphrases
- data sources
- data processing
- training data
- data sets
- raw data
- data collection
- small number
- complex data
- data points
- data structure
- input data
- original data
- knowledge discovery
- database
- data analysis
- high dimensional data
- experimental data
- training examples
- statistical methods
- missing values
- test data
- data distribution
- missing data
- synthetic data
- generative model
- computer systems
- machine learning
- learning algorithm
- image data
- end users
- probabilistic model