Actor-Critic Deep Reinforcement Learning for Dynamic Multichannel Access.

Chen Zhong Ziyang Lu Mustafa Cenk Gursoy Senem Velipasalar

Published in: GlobalSIP (2018)

Keyphrases

actor critic
reinforcement learning
policy gradient
function approximation
temporal difference
optimal control
reinforcement learning algorithms
approximate dynamic programming
gradient method
policy iteration
neuro fuzzy
learning algorithm
dynamic environments
state space
control problems
average reward
temporal difference learning
multi agent
infinite horizon
learning problems
supervised learning
dynamic programming