Linear Bellman Completeness Suffices for Efficient Online Reinforcement Learning with Few Actions.

Noah Golowich Ankur Moitra

Published in: COLT (2024)

Keyphrases

reinforcement learning
real time
online learning
supervised learning
action selection
state action
learning algorithm
search engine
learning process
online communities
reward function
reinforcement learning methods
partially observable domains