Login / Signup
Wasserstein Actor-Critic: Directed Exploration via Optimism for Continuous-Actions Control.
Amarildo Likmeta
Matteo Sacco
Alberto Maria Metelli
Marcello Restelli
Published in:
CoRR (2023)
Keyphrases
</>
action selection
actor critic
optimal control
temporal difference
reinforcement learning
action space
control system
mobile robot
policy gradient
reinforcement learning algorithms
function approximation
multiple agents
neural network
control strategy
evaluation function
monte carlo
machine learning