Behavior policy learning: Learning multi-stage tasks via solution sketches and model-based controllers.
Konstantinos TsinganosKonstantinos I. ChatzilygeroudisDenis HadjivelichkovTheodoros KomninosEvangelos DermatasDimitrios KanoulasPublished in: Future Internet (2022)