Login / Signup
Reinforcement Learning in Education: A Multi-armed Bandit Approach.
Herkulaas MvE Combrink
Vukosi Marivate
Benjamin Rosman
Published in:
AFRICATEK (2022)
Keyphrases
</>
multi armed bandit
reinforcement learning
multi armed bandits
state space
decentralized decision making
e learning
optimal policy
markov decision processes
temporal difference
learning problems
model free
multi agent
learning process
machine learning
distance measure
mutual information
learning algorithm