Imitating Cost-Constrained Behaviors in Reinforcement Learning.
Qian ShaoPradeep VarakanthamShih-Fen ChengPublished in: CoRR (2024)
Keyphrases
- reinforcement learning
- function approximation
- total cost
- multi agent
- reinforcement learning algorithms
- cost reduction
- real robot
- learning process
- temporal difference
- markov decision processes
- optimal control
- stochastic approximation
- real time
- expected cost
- model free
- optimal policy
- np hard
- query processing
- case study
- data mining
- neural network