​
Login / Signup
Hugh Zhang
Publication Activity (10 Years)
Years Active: 2019-2024
Publications (10 Years): 13
Top Topics
Aggregated Search
Dialog Systems
Natural Language
Lightweight
Top Venues
CoRR
AAMAS
NAACL-HLT (1)
AAAI
</>
Publications
</>
Kenneth Li
,
Samy Jelassi
,
Hugh Zhang
,
Sham M. Kakade
,
Martin Wattenberg
,
David Brandfonbrener
Q-Probe: A Lightweight Approach to Reward Maximization for Language Models.
CoRR
(2024)
Hugh Zhang
,
Jeff Da
,
Dean Lee
,
Vaughn Robinson
,
Catherine Wu
,
Will Song
,
Tiffany Zhao
,
Pranav Raja
,
Dylan Slack
,
Qin Lyu
,
Sean Hendryx
,
Russell Kaplan
,
Michele Lunati
,
Summer Yue
A Careful Examination of Large Language Model Performance on Grade School Arithmetic.
CoRR
(2024)
Luca D'Amico-Wong
,
Hugh Zhang
,
Marc Lanctot
,
David C. Parkes
Easy as ABCs: Unifying Boltzmann Q-Learning and Counterfactual Regret Minimization.
CoRR
(2024)
Vaskar Nath
,
Dylan Slack
,
Jeff Da
,
Yuntao Ma
,
Hugh Zhang
,
Spencer Whitehead
,
Sean Hendryx
Learning Goal-Conditioned Representations for Language Reward Models.
CoRR
(2024)
Huaixiu Steven Zheng
,
Swaroop Mishra
,
Hugh Zhang
,
Xinyun Chen
,
Minmin Chen
,
Azade Nova
,
Le Hou
,
Heng-Tze Cheng
,
Quoc V. Le
,
Ed H. Chi
,
Denny Zhou
NATURAL PLAN: Benchmarking LLMs on Natural Language Planning.
CoRR
(2024)
Hugh Zhang
No-regret Learning Dynamics for Sequential Correlated Equilibria.
AAMAS
(2023)
Hugh Zhang
,
David C. Parkes
Chain-of-Thought Reasoning is a Policy Improvement Operator.
CoRR
(2023)
Hugh Zhang
,
Adam Lerer
,
Noam Brown
Equilibrium Finding in Normal-Form Games Via Greedy Regret Minimization.
CoRR
(2022)
Hugh Zhang
A Simple Adaptive Procedure Converging to Forgiving Correlated Equilibria.
CoRR
(2022)
Hugh Zhang
,
Adam Lerer
,
Noam Brown
Equilibrium Finding in Normal-Form Games via Greedy Regret Minimization.
AAAI
(2022)
Hugh Zhang
,
Daniel Duckworth
,
Daphne Ippolito
,
Arvind Neelakantan
Trading Off Diversity and Quality in Natural Language Generation.
CoRR
(2020)
Tatsunori B. Hashimoto
,
Hugh Zhang
,
Percy Liang
Unifying Human and Statistical Evaluation for Natural Language Generation.
NAACL-HLT (1)
(2019)
Tatsunori B. Hashimoto
,
Hugh Zhang
,
Percy Liang
Unifying Human and Statistical Evaluation for Natural Language Generation.
CoRR
(2019)