Login / Signup
Average reward adjusted deep reinforcement learning for order release planning in manufacturing.
Manuel Schneckenreither
Stefan Haeussler
Juanjo Peiró
Published in:
Knowl. Based Syst. (2022)
Keyphrases
</>
reinforcement learning
average reward
optimal policy
markov decision processes
machine learning
multi agent systems
state space
search algorithm
supervised learning
linear programming
heuristic search