Deconfounding Actor-Critic Network with Policy Adaptation for Dynamic Treatment Regimes.
Changchang YinRuoqi LiuJeffrey M. CaterinoPing ZhangPublished in: KDD (2022)
Keyphrases
- actor critic
- policy gradient
- reinforcement learning
- optimal control
- neuro fuzzy
- function approximation
- neural network
- dynamic environments
- reinforcement learning algorithms
- gradient method
- cost function
- optimal policy
- sufficient conditions
- temporal difference
- state space
- natural actor critic
- policy gradient methods