Belief State Actor-Critic Algorithm from Separation Principle for POMDP.
Yujie YangYuxuan JiangJianyu ChenShengbo Eben LiZiqing GuYuming YinQian ZhangKai YuPublished in: ACC (2023)
Keyphrases
- belief state
- learning algorithm
- objective function
- point based value iteration
- partially observable markov decision processes
- dynamic programming
- computational complexity
- probabilistic model
- reinforcement learning
- search space
- np hard
- linear programming
- parameter estimation
- monte carlo
- kalman filter
- optimal solution
- machine learning