PHICON: Improving Generalization of Clinical Text De-identification Models via Data Augmentation.
Xiang YueShuang ZhouPublished in: ClinicalNLP@EMNLP (2020)
Keyphrases
- database
- data collection
- data sets
- historical data
- synthetic data
- data analysis
- information retrieval
- accurate models
- image data
- text mining
- high dimensional data
- training data
- data quality
- statistical methods
- experimental data
- data processing
- prior knowledge
- statistical analysis
- computer systems
- databases
- raw data
- text retrieval
- data sources
- textual data
- data structure