Login / Signup

Adaptation Method of the Exploration Ratio Based on the Orientation of Equilibrium in Multi-Agent Reinforcement Learning Under Non-Stationary Environments.

Takuya OkanoItsuki Noda
Published in: J. Adv. Comput. Intell. Intell. Informatics (2017)
Keyphrases
  • multi agent reinforcement learning
  • reinforcement learning
  • dynamic programming
  • multi agent
  • learning environment
  • multi agent systems
  • computational complexity
  • state space
  • learning strategies