In-context Reinforcement Learning with Algorithm Distillation.
Michael LaskinLuyu WangJunhyuk OhEmilio ParisottoStephen SpencerRichie SteigerwaldDJ StrouseSteven HansenAngelos FilosEthan A. BrooksMaxime GazeauHimanshu SahniSatinder SinghVolodymyr MnihPublished in: CoRR (2022)
Keyphrases
- learning algorithm
- optimization algorithm
- computational cost
- improved algorithm
- preprocessing
- dynamic programming
- experimental evaluation
- reinforcement learning
- detection algorithm
- computationally efficient
- model free
- recognition algorithm
- ant colony optimization
- simulated annealing
- high accuracy
- worst case
- similarity measure
- np hard
- probabilistic model
- theoretical analysis
- k means
- computational complexity
- monte carlo
- data sets
- function approximation
- optimal solution
- matching algorithm
- tree structure
- clustering method
- cost function
- particle swarm optimization
- linear programming