Deep Reinforcement Learning-based Rebalancing Policies for Profit Maximization of Relay Nodes in Payment Channel Networks.
Nikolaos PapadisLeandros TassiulasPublished in: CoRR (2022)
Keyphrases
- profit maximization
- reinforcement learning
- optimal policy
- control policies
- relay nodes
- markov decision process
- multi agent
- markov decision processes
- cooperative
- reward function
- learning algorithm
- dynamic programming
- state space
- wireless networks
- network structure
- energy efficient
- temporal difference
- revenue management