Hardware implementation of the upper confidence-bound algorithm for reinforcement learning.
Nevena RadovicMilena Zogovic ErcegPublished in: Comput. Electr. Eng. (2021)
Keyphrases
- hardware implementation
- reinforcement learning
- image processing algorithms
- learning algorithm
- pipeline architecture
- dynamic programming
- upper confidence bound
- signal processing
- efficient implementation
- contextual bandit
- software implementation
- fpga implementation
- real time
- video sequences
- dedicated hardware
- image binarization