Automatic Unit Test Data Generation and Actor-Critic Reinforcement Learning for Code Synthesis.
Philip John GorinskiMatthieu ZimmerGerasimos LampourasDerrick-Goh-Xin DeikIgnacio IacobacciPublished in: CoRR (2023)
Keyphrases
- actor critic
- reinforcement learning
- test data generation
- temporal difference
- policy gradient
- approximate dynamic programming
- optimal control
- reinforcement learning algorithms
- neuro fuzzy
- function approximation
- test cases
- simulated annealing algorithm
- policy iteration
- average reward
- gradient method
- state space
- markov decision processes
- optimal policy
- supervised learning
- policy gradient methods
- long run
- model free
- learning algorithm
- learning problems
- generation algorithm
- reinforcement learning methods
- rl algorithms
- source code