Login / Signup

ε-Policy Gradient for Online Pricing.

Lukasz SzpruchTanut TreetanthiploetYufei Zhang
Published in: CoRR (2024)
Keyphrases
  • policy gradient
  • parametric optimization
  • reinforcement learning
  • function approximation
  • approximation methods