Sign in

Policy Learning for Time-Bounded Reachability in Continuous-Time Markov Decision Processes via Doubly-Stochastic Gradient Ascent.

Ezio BartocciLuca BortolussiTomás BrázdilDimitrios MiliosGuido Sanguinetti
Published in: QEST (2016)
Keyphrases