Evaluating LLMs at Detecting Errors in LLM Responses.
Ryo KamoiSarkar Snigdha Sarathi DasRenze LouJihyun Janice AhnYilun ZhaoXiaoxin LuNan ZhangYusen ZhangRanran Haoran ZhangSujeeth Reddy VummanthalaSalika DaveShaobo QinArman CohanWenpeng YinRui ZhangPublished in: CoRR (2024)