Distributed No-Regret Learning for Multi-Stage Systems with End-to-End Bandit Feedback.

Published in: CoRR (2024)

Keyphrases

end to end
multistage
distributed systems
online learning
learning algorithm
reinforcement learning
real time
peer to peer
ad hoc networks
stochastic optimization
response time
wireless ad hoc networks
admission control
congestion control