Generating stable molecules using imitation and reinforcement learning.
Søren Ager MeldgaardJonas KöhlerHenrik Lund MortensenMads-Peter V. ChristiansenFrank NoéBjørk HammerPublished in: Mach. Learn. Sci. Technol. (2022)
Keyphrases
- reinforcement learning
- function approximation
- markov decision processes
- machine learning
- reinforcement learning algorithms
- state space
- temporal difference
- optimal policy
- multi agent reinforcement learning
- action selection
- model free
- policy search
- case study
- imitation learning
- temporal difference learning
- learning classifier systems
- learning algorithm
- transfer learning
- website
- data sets
- computational models
- learning problems
- generation process
- control problems
- markov decision process
- supervised learning