Policy-Based Reinforcement Learning for Assortative Matching in Human Behavior Modeling.

Ou Deng Qun Jin

Published in: CoRR (2022)

Keyphrases

human behavior
reinforcement learning
optimal policy
policy search
action selection
daily life
human subjects
markov decision process
real time
ground truth
function approximation
state space
human computer interaction
model free
low level
multi agent
agent behavior
learning algorithm