TOM: Learning Policy-Aware Models for Model-Based Reinforcement Learning via Transition Occupancy Matching.

Published in: CoRR (2023)

Keyphrases