Reinforcement Learning with Budget-Constrained Nonparametric Function Approximation for Opportunistic Spectrum Access.
Theodoros TsiligkaridisDavid RomeroPublished in: GlobalSIP (2018)
Keyphrases
- function approximation
- reinforcement learning
- temporal difference learning
- model free
- mountain car
- function approximators
- temporal difference
- temporal difference learning algorithms
- learning tasks
- tile coding
- state action space
- reinforcement learning algorithms
- radial basis function
- machine learning
- temporal difference methods
- multi agent
- data mining
- td learning
- neural network
- markov decision processes
- reinforcement learning methods
- e learning
- action selection
- supervised learning
- transfer learning
- hyperplane