Login / Signup
Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learning.
Subhojyoti Mukherjee
Josiah P. Hanna
Qiaomin Xie
Robert D. Nowak
Published in:
CoRR (2024)
Keyphrases
</>
multi task
learning tasks
multi task learning
reinforcement learning
learning problems
multiple tasks
multitask learning
active learning
feature selection
decision trees
learning process
multi class
unsupervised learning
gaussian processes
data sets
mutual information