Login / Signup
Fahim Tajwar
ORCID
Publication Activity (10 Years)
Years Active: 2021-2024
Publications (10 Years): 12
Top Topics
Reinforcement Learning
Reactive Behavior
Exploration Exploitation Tradeoff
Surgical Navigation
Top Venues
CoRR
ICLR
Proc. Natl. Acad. Sci. USA
Trans. Mach. Learn. Res.
</>
Publications
</>
Fahim Tajwar
,
Anikait Singh
,
Archit Sharma
,
Rafael Rafailov
,
Jeff Schneider
,
Tengyang Xie
,
Stefano Ermon
,
Chelsea Finn
,
Aviral Kumar
Preference Fine-Tuning of LLMs Should Leverage Suboptimal, On-Policy Data.
CoRR
(2024)
Caroline Choi
,
Fahim Tajwar
,
Yoonho Lee
,
Huaxiu Yao
,
Ananya Kumar
,
Chelsea Finn
Conservative Prediction via Data-Driven Confidence Minimization.
Trans. Mach. Learn. Res.
2024 (2024)
Yoonho Lee
,
Annie S. Chen
,
Fahim Tajwar
,
Ananya Kumar
,
Huaxiu Yao
,
Percy Liang
,
Chelsea Finn
Surgical Fine-Tuning Improves Adaptation to Distribution Shifts.
ICLR
(2023)
Max Sobol Mark
,
Archit Sharma
,
Fahim Tajwar
,
Rafael Rafailov
,
Sergey Levine
,
Chelsea Finn
Offline Retraining for Online RL: Decoupled Policy Learning to Mitigate Exploration Bias.
CoRR
(2023)
Caroline Choi
,
Fahim Tajwar
,
Yoonho Lee
,
Huaxiu Yao
,
Ananya Kumar
,
Chelsea Finn
Conservative Prediction via Data-Driven Confidence Minimization.
CoRR
(2023)
Allan Zhou
,
Fahim Tajwar
,
Alexander Robey
,
Tom Knowles
,
George J. Pappas
,
Hamed Hassani
,
Chelsea Finn
Do deep networks transfer invariances across classes?
ICLR
(2022)
Yoonho Lee
,
Annie S. Chen
,
Fahim Tajwar
,
Ananya Kumar
,
Huaxiu Yao
,
Percy Liang
,
Chelsea Finn
Surgical Fine-Tuning Improves Adaptation to Distribution Shifts.
CoRR
(2022)
Annie Xie
,
Fahim Tajwar
,
Archit Sharma
,
Chelsea Finn
When to Ask for Help: Proactive Interventions in Autonomous Reinforcement Learning.
NeurIPS
(2022)
Allan Zhou
,
Fahim Tajwar
,
Alexander Robey
,
Tom Knowles
,
George J. Pappas
,
Hamed Hassani
,
Chelsea Finn
Do Deep Networks Transfer Invariances Across Classes?
CoRR
(2022)
Annie Xie
,
Fahim Tajwar
,
Archit Sharma
,
Chelsea Finn
When to Ask for Help: Proactive Interventions in Autonomous Reinforcement Learning.
CoRR
(2022)
Jihyeon Janel Lee
,
Nina R. Brooks
,
Fahim Tajwar
,
Marshall Burke
,
Stefano Ermon
,
David B. Lobell
,
Debashish Biswas
,
Stephen P. Luby
Scalable deep learning to identify brick kilns and aid regulatory capacity.
Proc. Natl. Acad. Sci. USA
118 (17) (2021)
Fahim Tajwar
,
Ananya Kumar
,
Sang Michael Xie
,
Percy Liang
No True State-of-the-Art? OOD Detection Methods are Inconsistent across Datasets.
CoRR
(2021)