Login / Signup

Average reward adjusted deep reinforcement learning for order release planning in manufacturing.

Manuel SchneckenreitherStefan HaeusslerJuanjo Peiró
Published in: Knowl. Based Syst. (2022)
Keyphrases
  • reinforcement learning
  • average reward
  • optimal policy
  • markov decision processes
  • machine learning
  • multi agent systems
  • state space
  • search algorithm
  • supervised learning
  • linear programming
  • heuristic search