Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learning.

Published in: CoRR (2024)

Keyphrases