Login / Signup
reBandit: Random Effects based Online RL algorithm for Reducing Cannabis Use.
Susobhan Ghosh
Yongyi Guo
Pei-Yao Hung
Lara N. Coughlin
Erin Bonar
Inbal Nahum-Shani
Maureen A. Walton
Susan A. Murphy
Published in:
CoRR (2024)
Keyphrases
</>
dynamic programming
optimal solution
learning algorithm
expectation maximization
artificial neural networks
function approximation
model free