Recovering from Out-of-sample States via Inverse Dynamics in Offline Reinforcement Learning.
Ke JiangJia-Yu YaoXiaoyang TanPublished in: NeurIPS (2023)
Keyphrases
- inverse dynamics
- reinforcement learning
- nonlinear systems
- perceptual aliasing
- parallel manipulator
- function approximation
- transition model
- state space
- optimal policy
- adaptive control
- state variables
- machine learning
- model free
- markov decision processes
- real time
- learning algorithm
- image sequences
- knowledge base
- artificial intelligence