Constrained Contrastive Reinforcement Learning.

Haoyu Wang Xinrui Yang Yuhang Wang Xuguang Lan

Published in: ACML (2022)

Keyphrases

reinforcement learning
model free
function approximation
machine learning
multi agent reinforcement learning
reinforcement learning algorithms
markov decision processes
optimal policy
robotic control
policy search
learning agents
state space
multi agent
data mining
similarity measure
case study
optimal control
website
action selection
temporal difference
social networks
learning algorithm
temporal difference learning
stochastic approximation
genetic algorithm
database