Login / Signup
Balanced Q-learning: Combining the influence of optimistic and pessimistic targets.
Thommen George Karimpanal
Hung Le
Majid Abdolshah
Santu Rana
Sunil Gupta
Truyen Tran
Svetha Venkatesh
Published in:
Artif. Intell. (2023)
Keyphrases
</>
reinforcement learning
cooperative
multi agent
function approximation
learning algorithm
multi agent reinforcement learning
factors influencing
least squares
website
image sequences
search space
state space
information systems
optimal policy
machine learning
target detection
data sets
target recognition
database