Login / Signup
DIPPER: Direct Preference Optimization to Accelerate Primitive-Enabled Hierarchical Reinforcement Learning.
Utsav Singh
Souradip Chakraborty
Wesley A. Suttle
Brian M. Sadler
Vinay P. Namboodiri
Amrit Singh Bedi
Published in:
CoRR (2024)
Keyphrases
</>
hierarchical reinforcement learning
reinforcement learning
machine learning
hidden markov models