Learning adaptive control in dynamic environments using reproducing kernel priors with bayesian policy gradients.

Apan DastiderSayyed Jaffar Ali RazaMingjie Lin
Published in: SAC (2022)