Generating Synthetic Training Data for Supervised De-Identification of Electronic Health Records.
Claudia Alessandra LibbiJan TrienesDolf TrieschniggChristin SeifertPublished in: Future Internet (2021)
Keyphrases
- electronic health records
- training data
- supervised learning
- learning algorithm
- clinical data
- health data
- health information technology
- training set
- medical data
- clinical trials
- decision trees
- medical records
- real world
- health care
- health records
- semi supervised
- data sets
- domain knowledge
- information overload
- feature selection
- prior knowledge
- digital libraries
- distributed environment