Login / Signup
slimIPL: Language-Model-Free Iterative Pseudo-Labeling.
Tatiana Likhomanenko
Qiantong Xu
Jacob Kahn
Gabriel Synnaeve
Ronan Collobert
Published in:
CoRR (2020)
Keyphrases
</>
model free
reinforcement learning
reinforcement learning algorithms
function approximation
policy iteration
programming language
temporal difference
neural network
impedance control
average reward
multi agent
unsupervised learning
image classification
dynamic programming
active learning
policy evaluation
data sets