Parameterized Indexed Value Function for Efficient Exploration in Reinforcement Learning.

Tian Tan Zhihan Xiong Vikranth R. Dwaracherla

Published in: AAAI (2020)

Keyphrases

reinforcement learning
function approximators
state space
optimal policy
artificial intelligence
neural network
search engine
information systems
dynamic programming