Login / Signup

Policy learning in continuous-time Markov decision processes using Gaussian Processes.

Ezio BartocciLuca BortolussiTomás BrázdilDimitrios MiliosGuido Sanguinetti
Published in: Perform. Evaluation (2017)
Keyphrases