Login / Signup
Off-Policy Learning in Contextual Bandits for Remote Electrical Tilt Optimization.
Filippo Vannella
Jaeseong Jeong
Alexandre Proutière
Published in:
IEEE Trans. Veh. Technol. (2023)
Keyphrases
</>
learning algorithm
unsupervised learning
learning process
prior knowledge
knowledge acquisition
reinforcement learning
active learning
learning tasks
global optimization
real time
online learning
mobile learning
optimization method
learning scheme