Login / Signup
Greedy Policy Search: A Simple Baseline for Learnable Test-Time Augmentation.
Alexander Lyzhov
Yuliya Molchanova
Arsenii Ashukha
Dmitry Molchanov
Dmitry P. Vetrov
Published in:
UAI (2020)
Keyphrases
</>
policy search
reinforcement learning
dynamic programming
machine learning
neural network
multi agent
step size
continuous state