Effective reward function in discernment behavior reinforcement learning based on categorization progress.
Chyon Hae KimYusuke KonRicardo NavarroManabu GoukoYuichi KobayashiPublished in: Humanoids (2016)
Keyphrases
- reward function
- reinforcement learning
- reinforcement learning algorithms
- markov decision processes
- state space
- optimal policy
- policy search
- markov decision process
- inverse reinforcement learning
- function approximation
- data mining
- state action
- model free
- learning agent
- partially observable
- social networks
- machine learning