Login / Signup
Nandan Thakur
ORCID
Publication Activity (10 Years)
Years Active: 2020-2024
Publications (10 Years): 25
Top Topics
Retrieval Functions
Reference Models
Information Retrieval
Average Precision
Top Venues
CoRR
SIGIR
NAACL-HLT
TREC
</>
Publications
</>
Nandan Thakur
,
Luiz Bonifacio
,
Maik Fröbe
,
Alexander Bondarenko
,
Ehsan Kamalloo
,
Martin Potthast
,
Matthias Hagen
,
Jimmy Lin
Systematic Evaluation of Neural Retrieval Models on the Touché 2020 Argument Retrieval Subset of BEIR.
SIGIR
(2024)
Nandan Thakur
,
Jianmo Ni
,
Gustavo Hernández Ábrego
,
John Wieting
,
Jimmy Lin
,
Daniel Cer
Leveraging LLMs for Synthesizing Training Data Across Many Languages in Multilingual Dense Retrieval.
NAACL-HLT
(2024)
Ehsan Kamalloo
,
Nandan Thakur
,
Carlos Lassance
,
Xueguang Ma
,
Jheng-Hong Yang
,
Jimmy Lin
Resources for Brewing BEIR: Reproducible Reference Models and Statistical Analyses.
SIGIR
(2024)
Ronak Pradeep
,
Nandan Thakur
,
Sahel Sharifymoghaddam
,
Eric Zhang
,
Ryan Nguyen
,
Daniel Campos
,
Nick Craswell
,
Jimmy Lin
Ragnarök: A Reusable RAG Framework and Baselines for TREC 2024 Retrieval-Augmented Generation Track.
CoRR
(2024)
Nandan Thakur
,
Luiz Bonifacio
,
Maik Fröbe
,
Alexander Bondarenko
,
Ehsan Kamalloo
,
Martin Potthast
,
Matthias Hagen
,
Jimmy Lin
Systematic Evaluation of Neural Retrieval Models on the Touché 2020 Argument Retrieval Subset of BEIR.
CoRR
(2024)
Shivani Upadhyay
,
Ronak Pradeep
,
Nandan Thakur
,
Nick Craswell
,
Jimmy Lin
UMBRELA: UMbrela is the (Open-Source Reproduction of the) Bing RELevance Assessor.
CoRR
(2024)
Nandan Thakur
,
Kexin Wang
,
Iryna Gurevych
,
Jimmy Lin
SPRINT: A Unified Toolkit for Evaluating and Demystifying Zero-shot Neural Sparse Retrieval.
SIGIR
(2023)
Ehsan Kamalloo
,
Xinyu Zhang
,
Odunayo Ogundepo
,
Nandan Thakur
,
David Alfonso-Hermelo
,
Mehdi Rezagholizadeh
,
Jimmy Lin
Evaluating Embedding APIs for Information Retrieval.
ACL (industry)
(2023)
Ehsan Kamalloo
,
Nandan Thakur
,
Carlos Lassance
,
Xueguang Ma
,
Jheng-Hong Yang
,
Jimmy Lin
Resources for Brewing BEIR: Reproducible Reference Models and an Official Leaderboard.
CoRR
(2023)
Jimmy Lin
,
David Alfonso-Hermelo
,
Vitor Jeronymo
,
Ehsan Kamalloo
,
Carlos Lassance
,
Rodrigo Frassetto Nogueira
,
Odunayo Ogundepo
,
Mehdi Rezagholizadeh
,
Nandan Thakur
,
Jheng-Hong Yang
,
Xinyu Zhang
Simple Yet Effective Neural Ranking and Reranking Baselines for Cross-Lingual Information Retrieval.
CoRR
(2023)
Xinyu Zhang
,
Nandan Thakur
,
Odunayo Ogundepo
,
Ehsan Kamalloo
,
David Alfonso-Hermelo
,
Xiaoguang Li
,
Qun Liu
,
Mehdi Rezagholizadeh
,
Jimmy Lin
MIRACL: A Multilingual Retrieval Dataset Covering 18 Diverse Languages.
Trans. Assoc. Comput. Linguistics
11 (2023)
Ehsan Kamalloo
,
Xinyu Zhang
,
Odunayo Ogundepo
,
Nandan Thakur
,
David Alfonso-Hermelo
,
Mehdi Rezagholizadeh
,
Jimmy Lin
Evaluating Embedding APIs for Information Retrieval.
CoRR
(2023)
Nandan Thakur
,
Jianmo Ni
,
Gustavo Hernández Ábrego
,
John Wieting
,
Jimmy Lin
,
Daniel Cer
Leveraging LLMs for Synthesizing Training Data Across Many Languages in Multilingual Dense Retrieval.
CoRR
(2023)
Nandan Thakur
,
Kexin Wang
,
Iryna Gurevych
,
Jimmy Lin
SPRINT: A Unified Toolkit for Evaluating and Demystifying Zero-shot Neural Sparse Retrieval.
CoRR
(2023)
Nandan Thakur
,
Luiz Bonifacio
,
Xinyu Zhang
,
Odunayo Ogundepo
,
Ehsan Kamalloo
,
David Alfonso-Hermelo
,
Xiaoguang Li
,
Qun Liu
,
Boxing Chen
,
Mehdi Rezagholizadeh
,
Jimmy Lin
NoMIRACL: Knowing When You Don't Know for Robust Multilingual Retrieval-Augmented Generation.
CoRR
(2023)
Ehsan Kamalloo
,
Aref Jafari
,
Xinyu Zhang
,
Nandan Thakur
,
Jimmy Lin
HAGRID: A Human-LLM Collaborative Dataset for Generative Information-Seeking with Attribution.
CoRR
(2023)
Jimmy Lin
,
David Alfonso-Hermelo
,
Vitor Jeronymo
,
Ehsan Kamalloo
,
Carlos Lassance
,
Rodrigo Frassetto Nogueira
,
Odunayo Ogundepo
,
Mehdi Rezagholizadeh
,
Nandan Thakur
,
Jheng-Hong Yang
,
Xinyu Zhang
Simple Yet Effective Neural Ranking and Reranking Baselines for Cross-Lingual Information Retrieval.
TREC
(2022)
Xinyu Zhang
,
Nandan Thakur
,
Odunayo Ogundepo
,
Ehsan Kamalloo
,
David Alfonso-Hermelo
,
Xiaoguang Li
,
Qun Liu
,
Mehdi Rezagholizadeh
,
Jimmy Lin
Making a MIRACL: Multilingual Information Retrieval Across a Continuum of Languages.
CoRR
(2022)
Nandan Thakur
,
Nils Reimers
,
Jimmy Lin
Domain Adaptation for Memory-Efficient Dense Retrieval.
CoRR
(2022)
Kexin Wang
,
Nandan Thakur
,
Nils Reimers
,
Iryna Gurevych
GPL: Generative Pseudo Labeling for Unsupervised Domain Adaptation of Dense Retrieval.
NAACL-HLT
(2022)
Nandan Thakur
,
Nils Reimers
,
Andreas Rücklé
,
Abhishek Srivastava
,
Iryna Gurevych
BEIR: A Heterogeneous Benchmark for Zero-shot Evaluation of Information Retrieval Models.
NeurIPS Datasets and Benchmarks
(2021)
Nandan Thakur
,
Nils Reimers
,
Andreas Rücklé
,
Abhishek Srivastava
,
Iryna Gurevych
BEIR: A Heterogenous Benchmark for Zero-shot Evaluation of Information Retrieval Models.
CoRR
(2021)
Kexin Wang
,
Nandan Thakur
,
Nils Reimers
,
Iryna Gurevych
GPL: Generative Pseudo Labeling for Unsupervised Domain Adaptation of Dense Retrieval.
CoRR
(2021)
Nandan Thakur
,
Nils Reimers
,
Johannes Daxenberger
,
Iryna Gurevych
Augmented SBERT: Data Augmentation Method for Improving Bi-Encoders for Pairwise Sentence Scoring Tasks.
NAACL-HLT
(2021)
Nandan Thakur
,
Nils Reimers
,
Johannes Daxenberger
,
Iryna Gurevych
Augmented SBERT: Data Augmentation Method for Improving Bi-Encoders for Pairwise Sentence Scoring Tasks.
CoRR
(2020)