Login / Signup
Solving multi-model MDPs by coordinate ascent and dynamic programming.
Xihong Su
Marek Petrik
Published in:
UAI (2023)
Keyphrases
</>
dynamic programming
feature extraction
reinforcement learning
markov decision processes
face recognition
objective function
input data