Login / Signup
Nigel Tao
Publication Activity (10 Years)
Years Active: 2001-2013
Publications (10 Years): 0
Top Topics
Total Reward
Reinforcement Learning
Eligibility Traces
Learning Agent
Top Venues
CoRR
</>
Publications
</>
Lex Weaver
,
Nigel Tao
The Optimal Reward Baseline for Gradient-Based Reinforcement Learning
CoRR
(2013)
Nigel Tao
,
Jonathan Baxter
,
Lex Weaver
A Multi-Agent Policy-Gradient Approach to Network Routing.
ICML
(2001)
Lex Weaver
,
Nigel Tao
The Optimal Reward Baseline for Gradient-Based Reinforcement Learning.
UAI
(2001)