Login / Signup

Study on High-Level Structure of cognition control construction in Exploration and Exploitation within Multi-Armed Bandit Model of Reinforcement Learning.

Jiaxing TianJun Izawa
Published in: MHS (2023)
Keyphrases
  • high level
  • reinforcement learning
  • probabilistic model
  • parameter estimation
  • action selection
  • objective function
  • statistical models
  • model free