Login / Signup
A Reduction-Based Framework for Conservative Bandits and Reinforcement Learning.
Yunchang Yang
Tianhao Wu
Han Zhong
Evrard Garcelon
Matteo Pirotta
Alessandro Lazaric
Liwei Wang
Simon Shaolei Du
Published in:
ICLR (2022)
Keyphrases
</>
reinforcement learning
main contribution
probabilistic model
neural network
function approximation
database
real time
machine learning
objective function
multi agent
feature space
theoretical framework
conceptual framework