Solving reward-collecting problems with UAVs: a comparison of online optimization and Q-learning.
Yixuan LiuChrysafis VogiatzisRuriko YoshidaErich MormanPublished in: CoRR (2021)
Keyphrases
- combinatorial optimization
- optimization problems
- reinforcement learning
- discrete optimization
- nonlinear programming
- solving problems
- stochastic shortest path
- convex programming
- mathematical programming
- quadratic programming
- linear programming
- solving complex
- convex optimization problems
- optimization algorithm
- np complete
- online learning
- state space
- constrained optimization
- model free
- variational inequalities
- robust optimization
- efficient algorithms for solving
- multi agent