Blind Decision Making: Reinforcement Learning with Delayed Observations.
Mridul AgarwalVaneet AggarwalPublished in: CoRR (2020)
Keyphrases
- reinforcement learning
- decision making
- action selection
- decision makers
- function approximation
- decision support system
- decision process
- fuzzy logic
- reinforcement learning algorithms
- multi agent
- state space
- markov decision processes
- optimal policy
- information processing
- temporal difference
- robotic control
- transfer learning
- business intelligence
- decision support
- temporal difference learning
- supply chain
- autonomous learning
- transition model
- imprecise probabilities
- multi agent reinforcement learning
- real time
- model free
- evaluation function
- autonomous agents
- supervised learning
- dynamic programming
- multi agent systems
- machine learning
- data mining
- data sets