Improving Clinical NLP Performance through Language Model-Generated Synthetic Clinical Data.
Shan ChenJack GallifantMarco GuevaraYanjun GaoMajid AfsharTimothy MillerDmitriy DligachDanielle S. BittermanPublished in: CoRR (2024)
Keyphrases
- clinical data
- language model
- language modeling
- patient data
- medical data
- clinical information
- n gram
- raw data
- natural language processing
- probabilistic model
- statistical analysis
- information retrieval
- query expansion
- medical knowledge
- retrieval model
- mixture model
- knowledge discovery
- clinical databases
- clinical data sets
- patient records
- ad hoc information retrieval
- question answering
- smoothing methods
- clinical decision making
- information extraction
- context sensitive
- electronic medical record
- domain experts
- real patient data
- acute myocardial infarction
- natural language
- test collection
- machine learning
- medical images
- medical information
- temporal abstractions
- translation model
- cancer patients
- medical records
- survival data
- data sources
- mr images