CACTO-SL: Using Sobolev learning to improve continuous actor-critic with trajectory optimization.

Published in: L4DC (2024)

Keyphrases