Gradient-Descent for Randomized Controllers Under Partial Observability.

Linus Heck Jip Spel Sebastian Junges Joshua Moerman Joost-Pieter Katoen

Published in: VMCAI (2022)

Keyphrases

partial observability
reinforcement learning
planning problems
partially observable
belief state
state space
belief space
objective function
control system
planning under partial observability
learning rules
partially observable markov decision processes
learning algorithm
planning domains
markov decision process
control strategy
domain specific