Constructing synthetic datasets with generative artificial intelligence to train large language models to classify acute renal failure from clinical notes.
Onkar LitakeBrian H. ParkJeffrey L. TullyRodney A. GabrielPublished in: J. Am. Medical Informatics Assoc. (2024)
Keyphrases
- language model
- synthetic datasets
- real life
- language modeling
- real dataset
- experimental study
- probabilistic model
- real world
- document retrieval
- n gram
- test collection
- synthetic data
- mixture model
- speech recognition
- generative model
- information retrieval
- context sensitive
- language modelling
- retrieval model
- statistical language models
- query expansion
- language modeling framework
- smoothing methods
- vector space model
- document ranking
- original data
- unsupervised learning
- pseudo relevance feedback
- relevance model
- natural language processing
- high dimensional
- bayesian networks
- machine learning