Logically-Correct Reinforcement Learning.
Mohammadhosein HasanbeigAlessandro AbateDaniel KroeningPublished in: CoRR (2018)
Keyphrases
- reinforcement learning
- function approximation
- markov decision processes
- temporal difference
- reinforcement learning algorithms
- model free
- optimal policy
- robotic control
- continuous state
- machine learning
- state space
- genetic algorithm
- markov chain
- least squares
- learning process
- expert systems
- social networks
- artificial intelligence
- learning capabilities
- stochastic approximation
- learning algorithm
- multi agent reinforcement learning