Efficient Model-Based Concave Utility Reinforcement Learning through Greedy Mirror Descent.

Published in: AISTATS (2024)

Keyphrases