Adversarial Batch Inverse Reinforcement Learning: Learn to Reward from Imperfect Demonstration for Interactive Recommendation.

Published in: CoRR (2023)

Keyphrases