Ensemble vs. Data Sampling: Which Option Is Best Suited to Improve Classification Performance of Imbalanced Bioinformatics Data?
Taghi M. KhoshgoftaarAlireza FazelpourDavid J. DittmanAmri NapolitanoPublished in: ICTAI (2015)
Keyphrases
- data sets
- data analysis
- neural network
- machine learning
- raw data
- missing data
- data structure
- data sources
- data points
- data collection
- image data
- data processing
- data mining techniques
- database
- knowledge discovery
- high dimensional data
- training data
- classification algorithm
- statistical methods
- high dimensionality
- feature selection
- original data
- rare events