Empirically Verifying Hypotheses Using Reinforcement Learning.
Kenneth MarinoRob FergusArthur SzlamAbhinav GuptaPublished in: CoRR (2020)
Keyphrases
- reinforcement learning
- function approximation
- model checking
- temporal difference learning
- markov decision processes
- reinforcement learning algorithms
- temporal difference
- state space
- policy search
- multi agent
- optimal policy
- optimal control
- learning problems
- direct policy search
- model free
- website
- learning classifier systems
- machine learning
- transfer learning
- decision making