Login / Signup
MEReQ: Max-Ent Residual-Q Inverse RL for Sample-Efficient Alignment from Intervention.
Yuxin Chen
Chen Tang
Chenran Li
Ran Tian
Peter Stone
Masayoshi Tomizuka
Wei Zhan
Published in:
CoRR (2024)
Keyphrases
</>
real time
databases
information retrieval
reinforcement learning
multi agent
data structure
neural network
learning algorithm
search engine
case study
image alignment