Pareto Regret Analyses in Multi-objective Multi-armed Bandit.
Mengfan XuDiego KlabjanPublished in: CoRR (2022)
Keyphrases
- multi objective
- multi armed bandit
- multi armed bandits
- multi objective optimization
- evolutionary algorithm
- multiobjective optimization
- optimization algorithm
- regret bounds
- reinforcement learning
- genetic algorithm
- multiple objectives
- objective function
- nsga ii
- decentralized decision making
- pareto optimal
- online learning
- pareto optimal solutions
- multi agent