Generating Faithful Synthetic Data with Large Language Models: A Case Study in Computational Social Science.
Veniamin VeselovskyManoel Horta RibeiroAkhil AroraMartin JosifoskiAshton AndersonRobert WestPublished in: CoRR (2023)
Keyphrases
- synthetic data
- language model
- social sciences
- language modeling
- n gram
- probabilistic model
- document retrieval
- computer science
- real image data
- statistical language models
- information retrieval
- retrieval model
- data sets
- query expansion
- digital government
- language modelling
- speech recognition
- social scientists
- real world
- relevance model
- digital archiving
- ad hoc information retrieval
- context sensitive
- query terms
- vector space model
- smoothing methods
- test collection
- artificial intelligence
- statistical language modeling
- language model for information retrieval
- pseudo relevance feedback
- retrieval effectiveness
- machine learning