Navigating Data Scarcity: Pretraining for Medical Utterance Classification.
Do June MinVerónica Pérez-RosasRada MihalceaPublished in: ClinicalNLP@ACL (2023)
Keyphrases
- data sets
- database
- data points
- data reduction
- synthetic data
- data collection
- feature vectors
- high quality
- decision trees
- statistical analysis
- data analysis
- data processing
- image data
- pattern recognition
- raw data
- data mining techniques
- knowledge discovery
- original data
- end users
- data sources
- data mining
- support vector
- evidence based medicine
- medical information
- missing values
- data distribution
- health care
- missing data
- information systems
- high dimensional data
- feature selection
- classification accuracy
- feature extraction
- preprocessing
- data structure