Login / Signup
Greedy Ordering of Layer Weight Matrices in Transformers Improves Translation.
Elicia Ye
Published in:
CoRR (2023)
Keyphrases
</>
weight matrices
weight matrix
dynamic programming
search space
pattern recognition
object recognition
probabilistic model