Login / Signup

A Policy Adaptation Method for Implicit Multitask Reinforcement Learning Problems.

Satoshi YamamoriJun Morimoto
Published in: CoRR (2023)
Keyphrases
  • data mining
  • prior knowledge
  • multi task
  • reinforcement learning problems
  • pairwise
  • dynamic programming
  • unlabeled data
  • policy search