Login / Signup
Vector-Vector-Matrix Architecture: A Novel Hardware-Aware Framework for Low-Latency Inference in NLP Applications.
Matthew Khoury
Rumen Dangovski
Longwu Ou
Preslav Nakov
Yichen Shen
Li Jing
Published in:
CoRR (2020)
Keyphrases
</>
low latency
real time
high throughput
low cost
high speed
data sets
database
databases
data mining
query optimization
data acquisition