MoleGuLAR: Molecule Generation Using Reinforcement Learning with Alternating Rewards.
Manan GoelShampa RaghunathanSiddhartha LaghuvarapuU. Deva PriyakumarPublished in: J. Chem. Inf. Model. (2021)
Keyphrases
- reinforcement learning
- function approximation
- markov decision processes
- state space
- temporal difference
- reward shaping
- model free
- multi agent
- learning problems
- reinforcement learning algorithms
- optimal policy
- dynamic programming
- function approximators
- learning algorithm
- action space
- data sets
- robotic control
- transfer learning
- machine learning
- real world
- multi agent systems
- temporal difference learning
- state and action spaces