Performance improvement of reinforcement learning algorithms for online 3D bin packing using FPGA.
Kavya BorraAshwin KrishnanHarshad KhadilkarManoj NambiarAnsuma BasumataryRekha SinghalArijit MukherjeePublished in: AIMLSystems (2022)
Keyphrases
- bin packing
- reinforcement learning algorithms
- reinforcement learning
- state space
- markov decision processes
- model free
- learning algorithm
- reinforcement learning problems
- reinforcement learning methods
- temporal difference
- eligibility traces
- function approximation
- search tree
- packing problem
- dynamic environments
- graph colouring
- machine learning
- multi agent
- evaluation function
- function approximators
- optimal policy
- multi dimensional