An FPGA-Based On-Device Reinforcement Learning Approach using Online Sequential Learning.
Hirohisa WatanabeMineto TsukadaHiroki MatsutaniPublished in: CoRR (2020)
Keyphrases
- reinforcement learning
- learning process
- learning algorithm
- online learning
- active learning
- supervised learning
- eligibility traces
- real time
- learning systems
- learning problems
- online training
- passive aggressive
- active exploration
- learning capabilities
- hybrid learning
- unsupervised learning
- dynamic programming
- prior knowledge