Sign in
Simran Khanuja
Publication Activity (10 Years)
Years Active: 2019-2023
Publications (10 Years): 21
Top Topics
Event Extraction
Multi Lingual
Language Model
Comparative Analysis
Top Venues
CoRR
EMNLP
CodeSwitch@LREC
ACL (Findings)
</>
Publications
</>
Anubha Kabra
,
Emmy Liu
,
Simran Khanuja
,
Alham Fikri Aji
,
Genta Indra Winata
,
Samuel Cahyawijaya
,
Aremu Anuoluwapo
,
Perez Ogayo
,
Graham Neubig
Multi-lingual and Multi-cultural Figurative Language Understanding.
ACL (Findings)
(2023)
Yueqi Song
,
Catherine Cui
,
Simran Khanuja
,
Pengfei Liu
,
Fahim Faisal
,
Alissa Ostapenko
,
Genta Indra Winata
,
Alham Fikri Aji
,
Samuel Cahyawijaya
,
Yulia Tsvetkov
,
Antonios Anastasopoulos
,
Graham Neubig
GlobalBench: A Benchmark for Global Progress in Natural Language Processing.
CoRR
(2023)
Simran Khanuja
,
Sebastian Ruder
,
Partha Talukdar
Evaluating the Diversity, Equity, and Inclusion of NLP Technology: A Case Study for Indian Languages.
EACL (Findings)
(2023)
Anubha Kabra
,
Emmy Liu
,
Simran Khanuja
,
Alham Fikri Aji
,
Genta Indra Winata
,
Samuel Cahyawijaya
,
Aremu Anuoluwapo
,
Perez Ogayo
,
Graham Neubig
Multi-lingual and Multi-cultural Figurative Language Understanding.
CoRR
(2023)
Yueqi Song
,
Simran Khanuja
,
Pengfei Liu
,
Fahim Faisal
,
Alissa Ostapenko
,
Genta Winata
,
Alham Fikri Aji
,
Samuel Cahyawijaya
,
Yulia Tsvetkov
,
Antonios Anastasopoulos
,
Graham Neubig
GlobalBench: A Benchmark for Global Progress in Natural Language Processing.
EMNLP
(2023)
Simran Khanuja
,
Srinivas Gowriraj
,
Lucio M. Dery
,
Graham Neubig
DeMuX: Data-efficient Multilingual Learning.
CoRR
(2023)
Alexis Conneau
,
Ankur Bapna
,
Yu Zhang
,
Min Ma
,
Patrick von Platen
,
Anton Lozhkov
,
Colin Cherry
,
Ye Jia
,
Clara Rivera
,
Mihir Kale
,
Daan van Esch
,
Vera Axelrod
,
Simran Khanuja
,
Jonathan H. Clark
,
Orhan Firat
,
Michael Auli
,
Sebastian Ruder
,
Jason Riesa
,
Melvin Johnson
XTREME-S: Evaluating Cross-lingual Speech Representations.
CoRR
(2022)
Alexis Conneau
,
Min Ma
,
Simran Khanuja
,
Yu Zhang
,
Vera Axelrod
,
Siddharth Dalmia
,
Jason Riesa
,
Clara Rivera
,
Ankur Bapna
FLEURS: Few-shot Learning Evaluation of Universal Representations of Speech.
CoRR
(2022)
Simran Khanuja
,
Sebastian Ruder
,
Partha P. Talukdar
Evaluating Inclusivity, Equity, and Accessibility of NLP Technology: A Case Study for Indian Languages.
CoRR
(2022)
Alexis Conneau
,
Ankur Bapna
,
Yu Zhang
,
Min Ma
,
Patrick von Platen
,
Anton Lozhkov
,
Colin Cherry
,
Ye Jia
,
Clara Rivera
,
Mihir Kale
,
Daan van Esch
,
Vera Axelrod
,
Simran Khanuja
,
Jonathan H. Clark
,
Orhan Firat
,
Michael Auli
,
Sebastian Ruder
,
Jason Riesa
,
Melvin Johnson
XTREME-S: Evaluating Cross-lingual Speech Representations.
INTERSPEECH
(2022)
Ankur Bapna
,
Colin Cherry
,
Yu Zhang
,
Ye Jia
,
Melvin Johnson
,
Yong Cheng
,
Simran Khanuja
,
Jason Riesa
,
Alexis Conneau
mSLAM: Massively multilingual joint pre-training for speech and text.
CoRR
(2022)
Alexis Conneau
,
Min Ma
,
Simran Khanuja
,
Yu Zhang
,
Vera Axelrod
,
Siddharth Dalmia
,
Jason Riesa
,
Clara Rivera
,
Ankur Bapna
FLEURS: FEW-Shot Learning Evaluation of Universal Representations of Speech.
SLT
(2022)
Simran Khanuja
,
Melvin Johnson
,
Partha P. Talukdar
MergeDistill: Merging Language Models using Pre-trained Distillation.
ACL/IJCNLP (Findings)
(2021)
Simran Khanuja
,
Diksha Bansal
,
Sarvesh Mehtani
,
Savya Khosla
,
Atreyee Dey
,
Balaji Gopalan
,
Dilip Kumar Margam
,
Pooja Aggarwal
,
Rajiv Teja Nagipogu
,
Shachi Dave
,
Shruti Gupta
,
Subhash Chandra Bose Gali
,
Vish Subramanian
,
Partha P. Talukdar
MuRIL: Multilingual Representations for Indian Languages.
CoRR
(2021)
Simran Khanuja
,
Melvin Johnson
,
Partha P. Talukdar
MergeDistill: Merging Pre-trained Language Models using Distillation.
CoRR
(2021)
Simran Khanuja
,
Sandipan Dandapat
,
Sunayana Sitaram
,
Monojit Choudhury
A New Dataset for Natural Language Inference from Code-mixed Conversations.
CodeSwitch@LREC
(2020)
Simran Khanuja
,
Sandipan Dandapat
,
Anirudh Srinivasan
,
Sunayana Sitaram
,
Monojit Choudhury
GLUECoS : An Evaluation Benchmark for Code-Switched NLP.
CoRR
(2020)
Sanket Shah
,
Satarupa Guha
,
Simran Khanuja
,
Sunayana Sitaram
Cross-lingual and Multilingual Spoken Term Detection for Low-Resource Indian Languages.
CoRR
(2020)
Simran Khanuja
,
Sandipan Dandapat
,
Anirudh Srinivasan
,
Sunayana Sitaram
,
Monojit Choudhury
GLUECoS: An Evaluation Benchmark for Code-Switched NLP.
ACL
(2020)
Simran Khanuja
,
Sandipan Dandapat
,
Sunayana Sitaram
,
Monojit Choudhury
A New Dataset for Natural Language Inference from Code-mixed Conversations.
CoRR
(2020)
Pratik Joshi
,
Christain Barnes
,
Sebastin Santy
,
Simran Khanuja
,
Sanket Shah
,
Anirudh Srinivasan
,
Satwik Bhattamishra
,
Sunayana Sitaram
,
Monojit Choudhury
,
Kalika Bali
Unsung Challenges of Building and Deploying Language Technologies for Low Resource Language Communities.
CoRR
(2019)