Login / Signup

Improving Multilingual Instruction Finetuning via Linguistically Natural and Diverse Datasets.

Sathish Reddy IndurthiWenxuan ZhouShamil ChollampattRavi AgrawalKaiqiang SongLingxiao ZhaoChenguang Zhu
Published in: CoRR (2024)
Keyphrases
  • real world
  • benchmark datasets
  • database
  • digital libraries
  • cross lingual
  • raw data
  • case study
  • statistically significant
  • level parallelism