An Investigation into the Effects of Pre-training Data Distributions for Pathology Report Classification.
Aliyah R. HsuYeshwanth CherapanamjeriBriton ParkTristan NaumannAnobel Y. OdishoBin YuPublished in: CoRR (2023)
Keyphrases
- data distribution
- supervised learning
- training set
- training phase
- training process
- data streams
- decision boundary
- training samples
- pattern recognition
- support vector
- classification algorithm
- feature extraction
- feature vectors
- data sets
- text classification
- feature space
- decision trees
- support vector machine
- training examples
- feature selection
- concept drift
- machine learning
- training dataset
- neural network
- semi supervised
- active learning
- index structure
- test set
- class labels
- database