Multi-Armed Bandit for Link Configuration in Millimeter-Wave Networks: An Approach for Solving Sequential Decision-Making Problems.
Yi ZhangRobert W. Heath Jr.Published in: IEEE Veh. Technol. Mag. (2023)
Keyphrases
- sequential decision making problems
- reinforcement learning
- multi armed bandit
- decision theoretic planning
- millimeter wave
- markov decision problems
- markov decision processes
- dec pomdps
- linear programming
- partially observable markov decision processes
- probabilistic planning
- markov chain
- optimal policy
- imaging process
- radar images
- physical phenomena
- bayesian networks
- special case
- graphical models
- state space
- probability distribution
- dynamic programming