Login / Signup
Safetywashing: Do AI Safety Benchmarks Actually Measure Safety Progress?
Richard Ren
Steven Basart
Adam Khoja
Alice Gatti
Long Phan
Xuwang Yin
Mantas Mazeika
Alexander Pan
Gabriel Mukobi
Ryan H. Kim
Stephen Fitz
Dan Hendrycks
Published in:
CoRR (2024)
Keyphrases
</>
artificial intelligence
similarity measure
case based reasoning
coal mining
real time
databases
machine learning
united states
lecture notes in artificial intelligence
nuclear power plant
safety analysis