Increasing Diversity While Maintaining Accuracy: Text Data Generation with Large Language Models and Human Interventions.
John Joon Young ChungEce KamarSaleema AmershiPublished in: CoRR (2023)
Keyphrases
- language model
- data generation
- information retrieval
- language modeling
- n gram
- document retrieval
- probabilistic model
- language modelling
- document level
- query expansion
- speech recognition
- retrieval model
- test collection
- vector space model
- text mining
- smoothing methods
- error rate
- keywords
- document ranking
- search engine
- relevance model
- text retrieval
- classification accuracy
- data streams
- text documents
- anomaly detection
- okapi bm