Login / Signup
Learning-Based Mean-Payoff Optimization in an Unknown MDP under Omega-Regular Constraints.
Jan Kretínský
Guillermo A. Pérez
Jean-François Raskin
Published in:
CONCUR (2018)
Keyphrases
</>
learning process
optimization algorithm
learning algorithm
reinforcement learning
learning systems
constrained optimization
neural network
long term
knowledge acquisition
unsupervised learning
learning tasks
constraint programming