Login / Signup
Encode Once and Decode in Parallel: Efficient Transformer Decoding.
Bo-Ru Lu
Nikita Haduong
Chien-Yu Lin
Hao Cheng
Noah A. Smith
Mari Ostendorf
Published in:
CoRR (2024)
Keyphrases
</>
computationally efficient
cost effective
parallel implementation
real time
information retrieval
database systems
fuzzy logic
efficient implementation
parallel execution
machine learning
lightweight
power system
fault diagnosis
parallel algorithm