Login / Signup

Linear Latent World Models in Simple Transformers: A Case Study on Othello-GPT.

Dean S. HazinehZechen ZhangJeffery Chiu
Published in: CoRR (2023)
Keyphrases
  • evaluation function
  • probabilistic model
  • temporal difference learning
  • least squares
  • latent variables
  • game playing
  • game tree search
  • learning algorithm
  • case study
  • dynamic programming
  • upper bound
  • game tree