A Keyword-Enhanced Approach to Handle Class Imbalance in Clinical Text Classification.
Andrew E. BlanchardShang GaoHong-Jun YoonJames Blair ChristianEric B. DurbinXiao-Cheng WuAntoinette StroupJennifer A. DohertyStephen M. SchwartzCharles WigginsLinda CoyleLynne PenberthyGeorgia D. TourassiPublished in: IEEE J. Biomed. Health Informatics (2022)
Keyphrases
- class imbalance
- text classification
- feature selection
- class distribution
- cost sensitive
- active learning
- naive bayes
- text categorization
- sampling methods
- cost sensitive learning
- imbalanced datasets
- unlabeled data
- machine learning
- high dimensionality
- text mining
- labeled data
- minority class
- data cleaning
- multi label
- imbalanced data
- concept drift
- data mining
- k nearest neighbor
- classification accuracy
- imbalanced class distribution
- change detection
- unsupervised learning