Login / Signup
Automatic Deduction Path Learning via Reinforcement Learning with Environmental Correction.
Shuai Xiao
Chen Pan
Min Wang
Xinxin Zhu
Siqiao Xue
Jing Wang
Yunhua Hu
James Zhang
Jinghua Feng
Published in:
CoRR (2023)
Keyphrases
</>
reinforcement learning
learning process
learning algorithm
learning problems
supervised learning
autonomous learning
knowledge acquisition
optimal policy
learned knowledge
multi agent
prior knowledge
semi supervised
learning experience
mobile learning
stochastic games
evolutionary learning