In-context Reinforcement Learning with Algorithm Distillation.

Michael Laskin Luyu Wang Junhyuk Oh Emilio Parisotto Stephen Spencer Richie Steigerwald DJ Strouse Steven Stenberg Hansen Angelos Filos Ethan A. Brooks Maxime Gazeau Himanshu Sahni Satinder Singh Volodymyr Mnih

Published in: ICLR (2023)

Keyphrases

reinforcement learning
learning algorithm
improved algorithm
times faster
theoretical analysis
computational complexity
optimal solution
cost function
high accuracy
preprocessing
detection algorithm
optimization algorithm
expectation maximization
experimental evaluation
dynamic programming
worst case
computationally efficient
multi agent
segmentation algorithm
objective function
probabilistic model
particle swarm optimization
k means
search space
tree structure
monte carlo
function approximation
neural network