Login / Signup
Contextual Bandits and Imitation Learning via Preference-Based Active Queries.
Ayush Sekhari
Karthik Sridharan
Wen Sun
Runzhe Wu
Published in:
CoRR (2023)
Keyphrases
</>
imitation learning
query language
database
reinforcement learning
query processing
robotic systems
query evaluation
data objects
humanoid robot
xml documents
multi modal
maximum margin