Reinforcement learning with distance-based incentive/penalty (DIP) updates for highly constrained industrial control systems.

Published in: CoRR (2020)

Keyphrases