Stochastic online optimization. Single-point and multi-point non-linear multi-armed bandits. Convex and strongly-convex case.
Alexander V. GasnikovEkaterina A. KrymovaAnastasia A. LagunovskayaIlnura N. UsmanovaFedor A. FedorenkoPublished in: Autom. Remote. Control. (2017)