Towards Solving the Winograd Schema Challenge: Model-Free, Model-Based and a Spectrum in Between.

Weinan He Zhanhao Xiao

Published in: KSEM (2021)

Keyphrases

model free
reinforcement learning
function approximation
reinforcement learning algorithms
policy iteration
temporal difference
databases
data model
rl algorithms
policy evaluation
impedance control
data sets
feature extraction
markov decision problems