Login / Signup
Policy-Based Reinforcement Learning for Assortative Matching in Human Behavior Modeling.
Ou Deng
Qun Jin
Published in:
CoRR (2022)
Keyphrases
</>
human behavior
reinforcement learning
optimal policy
policy search
action selection
daily life
human subjects
markov decision process
real time
ground truth
function approximation
state space
human computer interaction
model free
low level
multi agent
agent behavior
learning algorithm