​
Login / Signup
Hassan Mansoor
Publication Activity (10 Years)
Years Active: 2016-2024
Publications (10 Years): 9
Top Topics
User Interface
Model Based Reasoning
Reinforcement Learning
Error Minimization
Top Venues
CoRR
ACL (Findings)
IWSM-Mensura
</>
Publications
</>
Gladys Tyen
,
Hassan Mansoor
,
Victor Carbune
,
Peter Chen
,
Tony Mak
LLMs cannot find reasoning errors, but can correct them given the error location.
ACL (Findings)
(2024)
Hakim Sidahmed
,
Samrat Phatale
,
Alex Hutcheson
,
Zhuonan Lin
,
Zhang Chen
,
Zac Yu
,
Jarvis Jin
,
Roman Komarytsia
,
Christiane Ahlheim
,
Yonghao Zhu
,
Simral Chaudhary
,
Bowen Li
,
Saravanan Ganesh
,
Bill Byrne
,
Jessica Hoffmann
,
Hassan Mansoor
,
Wei Li
,
Abhinav Rastogi
,
Lucas Dixon
PERL: Parameter Efficient Reinforcement Learning from Human Feedback.
CoRR
(2024)
Tautvydas Misiunas
,
Hassan Mansoor
,
Jasper Uijlings
,
Oriana Riva
,
Victor Carbune
VQA Training Sets are Self-play Environments for Generating Few-shot Pools.
CoRR
(2024)
Victor Carbune
,
Hassan Mansoor
,
Fangyu Liu
,
Rahul Aralikatte
,
Gilles Baechler
,
Jindong Chen
,
Abhanshu Sharma
Chart-based Reasoning: Transferring Capabilities from LLMs to VLMs.
CoRR
(2024)
Gilles Baechler
,
Srinivas Sunkara
,
Maria Wang
,
Fedir Zubach
,
Hassan Mansoor
,
Vincent Etter
,
Victor Carbune
,
Jason Lin
,
Jindong Chen
,
Abhanshu Sharma
ScreenAI: A Vision-Language Model for UI and Infographics Understanding.
CoRR
(2024)
Gladys Tyen
,
Hassan Mansoor
,
Peter Chen
,
Tony Mak
,
Victor Carbune
LLMs cannot find reasoning errors, but can correct them!
CoRR
(2023)
Sian Gooding
,
Hassan Mansoor
The Impact of Preference Agreement in Reinforcement Learning from Human Feedback: A Case Study in Summarization.
CoRR
(2023)
Harrison Lee
,
Samrat Phatale
,
Hassan Mansoor
,
Kellie Lu
,
Thomas Mesnard
,
Colton Bishop
,
Victor Carbune
,
Abhinav Rastogi
RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback.
CoRR
(2023)
Hassan Mansoor
,
Miroslaw Ochodek
Towards Semi-Automatic Size Measurement of User Interfaces in Web Applications with IFPUG SNAP.
IWSM-Mensura
(2016)