Tracking Drift-Plus-Penalty: Utility Maximization for Partially Observable and Controllable Networks.
Bai LiuQuang Minh NguyenQingkai LiangEytan H. ModianoPublished in: IEEE/ACM Trans. Netw. (2024)
Keyphrases
- partially observable
- utility maximization
- decision problems
- reinforcement learning
- state space
- markov decision processes
- utility function
- dynamical systems
- particle filter
- infinite horizon
- partial observations
- visual tracking
- stochastic gradient
- particle filtering
- belief state
- object tracking
- appearance model
- planning domains
- machine learning
- decision makers