Towards Better Opioid Antagonists Using Deep Reinforcement Learning.

Jianyuan Deng Zhibo Yang Yao Li Dimitris Samaras Fusheng Wang

Published in: CoRR (2020)

Keyphrases

reinforcement learning
function approximation
model free
robotic control
reinforcement learning algorithms
optimal control
learning algorithm
temporal difference
state space
multi agent
neural network
learning process
markov decision processes
learning problems
objective function
multiscale
markov chain
learning capabilities
partially observable
deep learning
learning agents
multi agent reinforcement learning
active exploration
dynamic programming