Login / Signup

MEReQ: Max-Ent Residual-Q Inverse RL for Sample-Efficient Alignment from Intervention.

Yuxin ChenChen TangChenran LiRan TianPeter StoneMasayoshi TomizukaWei Zhan
Published in: CoRR (2024)
Keyphrases
  • real time
  • databases
  • information retrieval
  • reinforcement learning
  • multi agent
  • data structure
  • neural network
  • learning algorithm
  • search engine
  • case study
  • image alignment