Dr.Spider: A Diagnostic Evaluation Benchmark towards Text-to-SQL Robustness.
Shuaichen ChangJun WangMingwen DongLin PanHenghui ZhuAlexander Hanbo LiWuwei LanSheng ZhangJiarong JiangJoseph LilienSteve AshWilliam Yang WangZhiguo WangVittorio CastelliPatrick NgBing XiangPublished in: ICLR (2023)