Cost-Sensitive BERT for Generalisable Sentence Classification with Imbalanced Data.
Harish Tayyar MadabushiElena KochkinaMichael CastellePublished in: CoRR (2020)
Keyphrases
- cost sensitive
- imbalanced data
- class imbalance
- class distribution
- cost sensitive classification
- cost sensitive learning
- misclassification costs
- multi class
- support vector machine
- binary classification
- minority class
- naive bayes
- cost sensitive boosting
- active learning
- base learners
- sampling methods
- base classifiers
- classification models
- svm classifier
- feature vectors
- machine learning
- data sets
- classification algorithm
- decision boundary
- text classification
- decision trees
- feature selection
- random forest
- class labels
- classification accuracy