A Convex Relaxation Approach to Bayesian Regret Minimization in Offline Bandits.
Mohammad GhavamzadehMarek PetrikGuy TennenholtzPublished in: CoRR (2023)
Keyphrases
- convex relaxation
- regret minimization
- convex optimization
- globally optimal
- multi label
- multistage
- nash equilibrium
- game theoretic
- multiple kernel learning
- optimization methods
- sparse approximation
- multi agent learning
- bayesian networks
- image processing
- genetic algorithm
- learning algorithm
- graph cuts
- higher order
- image features
- upper bound