Automatic identification of cited text spans: a multi-classifier approach over imbalanced dataset.
Shutian MaJin XuChengzhi ZhangPublished in: Scientometrics (2018)
Keyphrases
- automatic identification
- imbalanced datasets
- cost sensitive learning
- feature space
- decision trees
- training data
- learning from imbalanced data
- training dataset
- class distribution
- text mining
- class labels
- classification algorithm
- ensemble learning
- feature set
- support vector machine
- machine learning
- ensemble methods
- training examples
- svm classifier
- cost sensitive
- class imbalance
- decision boundary
- knn
- training set