Proposal and evaluation of deep exploitation-oriented learning under multiple reward environment.
Kazuteru MiyazakiPublished in: Cogn. Syst. Res. (2021)
Keyphrases
- reinforcement learning
- learning process
- learning systems
- learning algorithm
- dynamic environments
- real time
- deep architectures
- multiple tasks
- learning agent
- action selection
- combining multiple
- learning scheme
- learning analytics
- unsupervised learning
- multi agent systems
- training data
- genetic algorithm
- machine learning