Decision Mamba: Reinforcement Learning via Sequence Modeling with Selective State Spaces.
Toshihiro OtaPublished in: CoRR (2024)
Keyphrases
- reinforcement learning
- state space
- reinforcement learning algorithms
- decision makers
- markov decision processes
- learning algorithm
- decision making
- optimal policy
- goal state
- hidden state
- decision rules
- markov decision process
- model free
- function approximation
- machine learning
- markov chain
- supervised learning
- influence diagrams
- heuristic search
- case study
- robotic control