Imbalanced-learn: A Python Toolbox to Tackle the Curse of Imbalanced Datasets in Machine Learning.
Guillaume LemaitreFernando NogueiraChristos K. AridasPublished in: CoRR (2016)
Keyphrases
- imbalanced datasets
- machine learning
- cost sensitive learning
- learning from imbalanced data
- decision trees
- class imbalance
- class distribution
- ensemble methods
- imbalanced data
- cost sensitive
- sampling methods
- rare class
- training dataset
- minority class
- imbalanced class distribution
- highly skewed
- machine learning methods
- active learning
- missing values
- high dimensionality
- support vector machine
- high dimensional
- learning algorithm
- dimensionality reduction
- feature selection
- machine learning algorithms
- text classification
- feature selection algorithms
- misclassification costs
- data mining
- probability estimation
- base classifiers
- learning tasks
- transfer learning
- test set
- linear regression
- high dimensional data
- training samples
- training data