Login / Signup

Greedy Ordering of Layer Weight Matrices in Transformers Improves Translation.

Elicia Ye
Published in: CoRR (2023)
Keyphrases
  • weight matrices
  • weight matrix
  • dynamic programming
  • search space
  • pattern recognition
  • object recognition
  • probabilistic model