Continuous-time q-Learning for Jump-Diffusion Models under Tsallis Entropy.
Lijun BoYijie HuangXiang YuTingting ZhangPublished in: CoRR (2024)
Keyphrases
- np hard
- diffusion models
- markov chain
- state space
- diffusion model
- information diffusion
- reinforcement learning
- social networks
- steady state
- influence maximization
- learning algorithm
- optimal policy
- dynamical systems
- dynamic programming
- greedy algorithm
- optimal control
- viral marketing
- random walk
- anisotropic diffusion
- image sequences
- information systems