Login / Signup
Towards Solving the Winograd Schema Challenge: Model-Free, Model-Based and a Spectrum in Between.
Weinan He
Zhanhao Xiao
Published in:
KSEM (2021)
Keyphrases
</>
model free
reinforcement learning
function approximation
reinforcement learning algorithms
policy iteration
temporal difference
databases
data model
rl algorithms
policy evaluation
impedance control
data sets
feature extraction
markov decision problems