Micro-Armed Bandit: Lightweight & Reusable Reinforcement Learning for Microarchitecture Decision-Making.
Gerasimos GerogiannisJosep TorrellasPublished in: MICRO (2023)
Keyphrases
- lightweight
- reinforcement learning
- decision making
- decentralized decision making
- action selection
- multi agent
- multi armed bandit
- decision support system
- function approximation
- early stage
- reinforcement learning algorithms
- markov decision processes
- development environments
- decision process
- decision makers
- supply chain
- machine learning
- data mining
- software systems
- model free
- handheld devices
- learning algorithm
- temporal difference
- state space
- dos attacks
- dynamic programming