Login / Signup
In-context Exploration-Exploitation for Reinforcement Learning.
Zhenwen Dai
Federico Tomasi
Sina Ghiassian
Published in:
CoRR (2024)
Keyphrases
</>
reinforcement learning
exploration exploitation
active learning
function approximation
contextual information
learning algorithm
state space
optimal solution
data sets
feature extraction
semi supervised learning
optimal policy
bandit problems