Setting the Trap: Capturing and Defeating Backdoors in Pretrained Language Models through Honeypots.

Published in: NeurIPS (2023)

Keyphrases