Reward-Biased Maximum Likelihood Estimation for Neural Contextual Bandits: A Distributional Learning Perspective.
Yu-Heng HungPing-Chun HsiehPublished in: AAAI (2023)
Keyphrases
- maximum likelihood estimation
- learning algorithm
- reinforcement learning
- boltzmann machine
- multi armed bandits
- parameter estimation
- online learning
- probability distribution
- least squares
- neural network
- maximum likelihood
- em algorithm
- unsupervised learning
- feature extraction
- detection algorithm
- clustering algorithm
- computer vision
- data mining