Login / Signup
Tricking LLMs into Disobedience: Formalizing, Analyzing, and Detecting Jailbreaks.
Abhinav Rao
Atharva Naik
Sachin Vashistha
Somak Aditya
Monojit Choudhury
Published in:
LREC/COLING (2024)
Keyphrases
</>
automatic detection
artificial intelligence
social networks
computer vision
decision making
web services