Resetting the Optimizer in Deep RL: An Empirical Study.
Kavosh AsadiRasool FakoorShoham SabachPublished in: NeurIPS (2023)
Keyphrases
- reinforcement learning
- markov decision processes
- optimization algorithm
- optimal policy
- query optimization
- learning algorithm
- state space
- action selection
- model free
- multi agent
- learning process
- data sets
- learning agents
- action space
- reinforcement learning algorithms
- supervised learning
- case study
- e learning
- decision making
- machine learning
- data mining