DecodingTrust: A Comprehensive Assessment of Trustworthiness in GPT Models.
Boxin WangWeixin ChenHengzhi PeiChulin XieMintong KangChenhui ZhangChejian XuZidi XiongRitik DuttaRylan SchaefferSang T. TruongSimran AroraMantas MazeikaDan HendrycksZinan LinYu ChengSanmi KoyejoDawn SongBo LiPublished in: CoRR (2023)