Login / Signup

A reinforcement learning method using a dynamic reinforcement function based on action selection probability.

Yugo HasegawaSatoko TakadaHidehiro NakanoShuichi AraiArata Miyauchi
Published in: Systems and Computers in Japan (2007)
Keyphrases