Recovery RL: Safe Reinforcement Learning with Learned Recovery Zones.
Brijen ThananjeyanAshwin BalakrishnaSuraj NairMichael LuoKrishnan SrinivasanMinho HwangJoseph E. GonzalezJulian IbarzChelsea FinnKen GoldbergPublished in: CoRR (2020)
Keyphrases
- reinforcement learning
- function approximation
- recovery algorithm
- model free
- action selection
- image recovery
- state space
- optimal policy
- reinforcement learning algorithms
- dynamic programming
- autonomous learning
- temporal difference
- learning classifier systems
- supervised learning
- learned knowledge
- continuous state
- failure recovery