Login / Signup
Adaptation Method of the Exploration Ratio Based on the Orientation of Equilibrium in Multi-Agent Reinforcement Learning Under Non-Stationary Environments.
Takuya Okano
Itsuki Noda
Published in:
J. Adv. Comput. Intell. Intell. Informatics (2017)
Keyphrases
</>
multi agent reinforcement learning
reinforcement learning
dynamic programming
multi agent
learning environment
multi agent systems
computational complexity
state space
learning strategies