Login / Signup
Dual Policy Reinforcement Learning for Real-time Rebalancing in Bike-sharing Systems.
Jiaqi Liang
Defeng Liu
Sanjay Dominik Jena
Andrea Lodi
Thibaut Vidal
Published in:
CoRR (2024)
Keyphrases
</>
real time
reinforcement learning
optimal policy
function approximation
telecommunication systems
distributed systems
multi agent
management system
low cost
vision system
control system
real time systems
robotic control
dynamic programming
computer systems
information sharing
data structure
transition model