Survival Instinct in Offline Reinforcement Learning.

Anqi Li Dipendra Misra Andrey Kolobov Ching-An Cheng

Published in: NeurIPS (2023)

Keyphrases

reinforcement learning
function approximation
machine learning
real time
reinforcement learning algorithms
state space
markov decision processes
database
learning algorithm
artificial intelligence
temporal difference
learning process
optimal control
optimal policy
control problems
model free
multi agent
genetic algorithm
databases
control system
active learning
search space
function approximators
survival analysis
reinforcement learning methods
stochastic approximation
continuous state
evolutionary learning
direct policy search