Survival Instinct in Offline Reinforcement Learning.
Anqi LiDipendra MisraAndrey KolobovChing-An ChengPublished in: CoRR (2023)
Keyphrases
- reinforcement learning
- function approximation
- markov decision processes
- reinforcement learning algorithms
- state space
- real time
- model free
- temporal difference
- decision making
- multi agent
- temporal difference learning
- direct policy search
- robotic control
- multi agent reinforcement learning
- learning agent
- mobile robot
- active learning
- information retrieval