Login / Signup
Provably More Efficient Q-Learning in the Full-Feedback/One-Sided-Feedback Settings.
Xiao-Yue Gong
David Simchi-Levi
Published in:
CoRR (2020)
Keyphrases
</>
reinforcement learning
learning algorithm
cooperative
relevance feedback
data mining
case study
multi agent
data structure
real time
machine learning
decision trees
search space
computationally efficient
user feedback
feedback information
multi agent reinforcement learning