Login / Signup
Vector-Vector-Matrix Architecture: A Novel Hardware-Aware Framework for Low-Latency Inference in NLP Applications.
Matthew Khoury
Rumen Dangovski
Longwu Ou
Preslav Nakov
Yichen Shen
Li Jing
Published in:
EMNLP (1) (2020)
Keyphrases
</>
low latency
real time
high speed
computational complexity
low cost
high throughput
database
data mining
highly efficient