Differentiable Meta-Learning of Bandit Policies.
Craig BoutilierChih-Wei HsuBranislav KvetonMartin MladenovCsaba SzepesváriManzil ZaheerPublished in: NeurIPS (2020)
Keyphrases
- meta learning
- inductive learning
- meta knowledge
- learning tasks
- model selection
- machine learning algorithms
- decision trees
- machine learning
- feature selection
- data mining
- base classifiers
- multi armed bandit problems
- metamodel
- transfer learning
- multi class
- learning algorithm
- learning process
- expert systems
- e learning
- artificial intelligence
- data sets