Login / Signup
Adversarial Constrained Bidding via Minimax Regret Optimization with Causality-Aware Reinforcement Learning.
Haozhe Wang
Chao Du
Panyan Fang
Li He
Liang Wang
Bo Zheng
Published in:
CoRR (2023)
Keyphrases
</>
reinforcement learning
minimax regret
multi agent
stochastic programming
utility elicitation
decision problems
preference elicitation
reward function
state space
optimal policy
data sets
incomplete information
utility function
robust optimization
supervised learning
np hard
bayesian networks