Multi-Stage Reinforcement Learning for Non-Prehensile Manipulation.
Dexin WangChunsheng LiuFaliang ChangHengqiang HuanKun ChengPublished in: IEEE Robotics Autom. Lett. (2024)
Keyphrases
- multistage
- reinforcement learning
- optimal policy
- dynamic programming
- single stage
- production system
- state space
- stochastic programming
- function approximation
- lot sizing
- markov decision processes
- temporal difference
- stochastic optimization
- machine learning
- model free
- finite state
- reinforcement learning algorithms
- finite horizon
- multi agent
- average cost
- linear program
- markov decision process
- assembly systems