Login / Signup
Parameterized Indexed Value Function for Efficient Exploration in Reinforcement Learning.
Tian Tan
Zhihan Xiong
Vikranth R. Dwaracherla
Published in:
AAAI (2020)
Keyphrases
</>
reinforcement learning
function approximators
state space
optimal policy
artificial intelligence
neural network
search engine
information systems
dynamic programming