A review on modeling tumor dynamics and agent reward functions in reinforcement learning based therapy optimization.
Márton György AlmásyAndrás HörömpoDániel KissGábor KertészPublished in: J. Intell. Fuzzy Syst. (2022)
Keyphrases
- reward function
- reinforcement learning
- reinforcement learning algorithms
- markov decision processes
- markov decision process
- multiple agents
- state space
- policy search
- partially observable
- optimal policy
- inverse reinforcement learning
- multi agent
- transition model
- state action
- learning agent
- initially unknown
- action selection
- transition probabilities
- state variables
- learning algorithm
- function approximation
- dynamical systems
- breast cancer
- generative model
- learning agents
- markov decision problems
- markov chain
- multi agent systems
- objective function