Login / Signup
DFX: A Low-latency Multi-FPGA Appliance for Accelerating Transformer-based Text Generation.
Seongmin Hong
Seungjae Moon
Junsoo Kim
Sungjae Lee
Minsub Kim
Dongsoo Lee
Joo-Young Kim
Published in:
HCS (2022)
Keyphrases
</>
low latency
text generation
high speed
real time
natural language generation
high bandwidth
high throughput
massive scale
highly efficient
natural language
virtual machine
stream processing
data acquisition
continuous query processing
data flow
machine translation
query optimization