Logically-Correct Reinforcement Learning.

Mohammadhosein Hasanbeig Alessandro Abate Daniel Kroening

Published in: CoRR (2018)

Keyphrases

reinforcement learning
function approximation
markov decision processes
temporal difference
reinforcement learning algorithms
model free
optimal policy
robotic control
continuous state
machine learning
state space
genetic algorithm
markov chain
least squares
learning process
expert systems
social networks
artificial intelligence
learning capabilities
stochastic approximation
learning algorithm
multi agent reinforcement learning