Login / Signup
Earning While Learning: An Adversarial Multi-Armed Bandit Based Real-Time Bidding Scheme in Deregulated Electricity Market.
Yufeng Wang
Bo Zhang
Jianhua Ma
Qun Jin
Published in:
IEEE Trans. Netw. Sci. Eng. (2022)
Keyphrases
</>
electricity markets
learning process
electric power
reinforcement learning
learning algorithm
unit commitment
upper bound