Login / Signup
A Reward Shaping Approach for Reserve Price Optimization using Deep Reinforcement Learning.
Reza Refaei Afshar
Jason Rhuggenaath
Yingqian Zhang
Uzay Kaymak
Published in:
IJCNN (2021)
Keyphrases
</>
reward shaping
reinforcement learning
reinforcement learning algorithms
complex domains
state space
optimal policy
temporal difference
neural network
learning algorithm
markov decision processes
markov decision process
mobile robot
markov decision problems