Login / Signup
Efficient Reward Poisoning Attacks on Online Deep Reinforcement Learning.
Yinglun Xu
Qi Zeng
Gagandeep Singh
Published in:
Trans. Mach. Learn. Res. (2023)
Keyphrases
</>
reinforcement learning
balancing exploration and exploitation
function approximation
multi agent
state space
online learning
optimal policy
countermeasures
model free
temporal difference