Joint learning of reward machines and policies in environments with partially known semantics.

Published in: Artif. Intell. (2024)

Keyphrases