The Option Keyboard: Combining Skills in Reinforcement Learning.
André BarretoDiana BorsaShaobo HouGheorghe ComaniciEser AygünPhilippe HamelDaniel ToyamaJonathan J. HuntShibl MouradDavid SilverDoina PrecupPublished in: CoRR (2021)
Keyphrases
- reinforcement learning
- function approximation
- learning algorithm
- machine learning
- multi agent reinforcement learning
- data sets
- temporal difference learning
- model free
- error rate
- user interface
- learning process
- information technology
- optimal policy
- multi agent
- combining multiple
- website
- reinforcement learning algorithms
- information systems
- genetic algorithm