Survival Instinct in Offline Reinforcement Learning.
Anqi LiDipendra MisraAndrey KolobovChing-An ChengPublished in: NeurIPS (2023)
Keyphrases
- reinforcement learning
- function approximation
- machine learning
- real time
- reinforcement learning algorithms
- state space
- markov decision processes
- database
- learning algorithm
- artificial intelligence
- temporal difference
- learning process
- optimal control
- optimal policy
- control problems
- model free
- multi agent
- genetic algorithm
- databases
- control system
- active learning
- search space
- function approximators
- survival analysis
- reinforcement learning methods
- stochastic approximation
- continuous state
- evolutionary learning
- direct policy search