Login / Signup
Richard Ren
Publication Activity (10 Years)
Years Active: 2023-2024
Publications (10 Years): 4
Top Topics
Case Based Reasoning
Feature Representation
Macro Actions
Partially Observable
Top Venues
CoRR
IEEE Robotics Autom. Lett.
</>
Publications
</>
Richard Ren
,
Steven Basart
,
Adam Khoja
,
Alice Gatti
,
Long Phan
,
Xuwang Yin
,
Mantas Mazeika
,
Alexander Pan
,
Gabriel Mukobi
,
Ryan H. Kim
,
Stephen Fitz
,
Dan Hendrycks
Safetywashing: Do AI Safety Benchmarks Actually Measure Safety Progress?
CoRR
(2024)
Aaron Hao Tan
,
Federico Pizarro Bejarano
,
Yuhan Zhu
,
Richard Ren
,
Goldie Nejat
Deep Reinforcement Learning for Decentralized Multi-Robot Exploration With Macro Actions.
IEEE Robotics Autom. Lett.
8 (1) (2023)
Andy Zou
,
Long Phan
,
Sarah Chen
,
James Campbell
,
Phillip Guo
,
Richard Ren
,
Alexander Pan
,
Xuwang Yin
,
Mantas Mazeika
,
Ann-Kathrin Dombrowski
,
Shashwat Goel
,
Nathaniel Li
,
Michael J. Byun
,
Zifan Wang
,
Alex Mallen
,
Steven Basart
,
Sanmi Koyejo
,
Dawn Song
,
Matt Fredrikson
,
J. Zico Kolter
,
Dan Hendrycks
Representation Engineering: A Top-Down Approach to AI Transparency.
CoRR
(2023)
James Campbell
,
Richard Ren
,
Phillip Guo
Localizing Lying in Llama: Understanding Instructed Dishonesty on True-False Questions Through Prompting, Probing, and Patching.
CoRR
(2023)