Login / Signup
Linear Bellman Completeness Suffices for Efficient Online Reinforcement Learning with Few Actions.
Noah Golowich
Ankur Moitra
Published in:
COLT (2024)
Keyphrases
</>
reinforcement learning
real time
online learning
supervised learning
action selection
state action
learning algorithm
search engine
learning process
online communities
reward function
reinforcement learning methods
partially observable domains