Learning non-Markovian Decision-Making from State-only Sequences.
Aoyang QinFeng GaoQing LiSong-Chun ZhuSirui XiePublished in: NeurIPS (2023)
Keyphrases
- decision making
- learning algorithm
- reinforcement learning
- learning process
- decision support system
- learning scheme
- learning systems
- reinforcement learning agents
- prior knowledge
- decision makers
- supervised learning
- knowledge acquisition
- decision support
- dynamic programming
- learning experience
- markov decision processes
- hidden markov models