Login / Signup
Sampling Efficient Deep Reinforcement Learning through Preference-Guided Stochastic Exploration.
Wenhui Huang
Cong Zhang
Jingda Wu
Xiangkun He
Jie Zhang
Chen Lv
Published in:
CoRR (2022)
Keyphrases
</>
reinforcement learning
monte carlo
optimal policy
learning automata
machine learning
hidden markov models
genetic algorithm
multi agent
mobile robot
computationally efficient
stochastic approximation
active exploration
direct policy search