Login / Signup
State-Wise Adaptive Discounting from Experience (SADE): A Novel Discounting Scheme for Reinforcement Learning (Student Abstract).
Milan Zinzuvadiya
Vahid Behzadan
Published in:
AAAI (2021)
Keyphrases
</>
reinforcement learning
state space
learning process
student learning
machine learning
student model
normalized maximum likelihood
multi agent
high level
optimal policy
model free
knowledge level