Login / Signup
Downstream Datasets Make Surprisingly Good Pretraining Corpora.
Kundan Krishna
Saurabh Garg
Jeffrey P. Bigham
Zachary C. Lipton
Published in:
ACL (1) (2023)
Keyphrases
</>
text data
natural language processing
benchmark datasets
data sets
database
expert systems
uci machine learning repository
uci repository
synthetic and real datasets
text corpora
raw data
search engine
artificial intelligence
learning algorithm
real world
databases
real time