Sign in

MADLAD-400: A Multilingual And Document-Level Large Audited Dataset.

Sneha KuduguntaIsaac CaswellBiao ZhangXavier GarciaChristopher A. Choquette-ChooKatherine LeeDerrick XinAditya KusupatiRomi StellaAnkur BapnaOrhan Firat
Published in: CoRR (2023)
Keyphrases