Login / Signup
Test-Time Regret Minimization in Meta Reinforcement Learning.
Mirco Mutti
Aviv Tamar
Published in:
CoRR (2024)
Keyphrases
</>
reinforcement learning
regret minimization
supervised learning
neural network
machine learning
artificial intelligence
function approximation
meta level
policy search
domain knowledge
test cases
test data
markov decision processes
robotic control