Imitating Cost-Constrained Behaviors in Reinforcement Learning.

Qian Shao Pradeep Varakantham Shih-Fen Cheng

Published in: CoRR (2024)

Keyphrases

reinforcement learning
function approximation
total cost
multi agent
reinforcement learning algorithms
cost reduction
real robot
learning process
temporal difference
markov decision processes
optimal control
stochastic approximation
real time
expected cost
model free
optimal policy
np hard
query processing
case study
data mining
neural network