INGENIOUS: Using Informative Data Subsets for Efficient Pre-Training of Language Models.
H. S. V. N. S. Kowndinya RenduchintalaKrishnateja KillamsettySumit BhatiaMilan AggarwalGanesh RamakrishnanRishabh K. IyerBalaji KrishnamurthyPublished in: EMNLP (Findings) (2023)