Reinforcement Learning with Depreciating Assets.
Taylor DohmenAshutosh TrivediPublished in: CoRR (2023)
Keyphrases
- reinforcement learning
- function approximation
- reinforcement learning algorithms
- model free
- state space
- direct policy search
- learning algorithm
- learning capabilities
- information systems
- multi agent
- optimal policy
- markov decision processes
- robot control
- financial markets
- reward function
- temporal difference
- optimal control
- real time
- transfer learning
- dynamic programming
- machine learning
- data mining