Blind Decision Making: Reinforcement Learning with Delayed Observations.
Mridul AgarwalVaneet AggarwalPublished in: ICAPS (2021)
Keyphrases
- reinforcement learning
- decision making
- action selection
- decision makers
- function approximation
- decision support system
- decision support
- fuzzy logic
- multi agent reinforcement learning
- model free
- reasoning and decision making
- multi criteria
- information processing
- supply chain
- machine learning
- state space
- bounded rationality
- temporal difference learning
- reinforcement learning algorithms
- temporal difference
- data mining
- robotic control
- support systems
- learning algorithm
- multi agent
- expert systems
- search space
- markov decision processes
- optimal policy
- learning process
- dynamic programming