Login / Signup
Learning Good State and Action Representations for Markov Decision Process via Tensor Decomposition.
Chengzhuo Ni
Yaqi Duan
Munther Dahleh
Mengdi Wang
Anru R. Zhang
Published in:
J. Mach. Learn. Res. (2023)
Keyphrases
</>
markov decision process
state space
reinforcement learning
state action
supervised learning
learning algorithm
auxiliary information
transition probabilities
infinite horizon