Login / Signup
A Dictionary-based Oversampling Approach to Clinical Document Classification on Small and Imbalanced Dataset.
Mahdi Abdollahi
Xiaoying Gao
Yi Mei
Shameek Ghosh
Jinyan Li
Published in:
WI/IAT (2020)
Keyphrases
</>
document classification
imbalanced datasets
text classification
text categorization
class imbalance
classification algorithm
text mining
web documents
text documents
class distribution
active learning
cost sensitive learning
data sets
knowledge discovery
probabilistic model
sampling methods
training data