Sign in
Ali Hadi Zadeh
ORCID
Publication Activity (10 Years)
Years Active: 2019-2022
Publications (10 Years): 10
Top Topics
Efficient Inference
Language Model
Low Latency
Neural Network Training
Top Venues
CoRR
MICRO
IISWC
ISCA
</>
Publications
</>
Ali Hadi Zadeh
,
Mostafa Mahmoud
,
Ameer Abdelhadi
,
Andreas Moshovos
Mokey: Enabling Narrow Fixed-Point Inference for Out-of-the-Box Floating-Point Transformer Models.
CoRR
(2022)
Milos Nikolic
,
Enrique Torres-Sánchez
,
Jiahui Wang
,
Ali Hadi Zadeh
,
Mostafa Mahmoud
,
Ameer Abdelhadi
,
Andreas Moshovos
Schrödinger's FP: Dynamic Adaptation of Floating-Point Containers for Deep Learning Training.
CoRR
(2022)
Ali Hadi Zadeh
,
Mostafa Mahmoud
,
Ameer Abdelhadi
,
Andreas Moshovos
Mokey: enabling narrow fixed-point inference for out-of-the-box floating-point transformer models.
ISCA
(2022)
Omar Mohamed Awad
,
Mostafa Mahmoud
,
Isak Edo
,
Ali Hadi Zadeh
,
Ciaran Bannon
,
Anand Jayarajan
,
Gennady Pekhimenko
,
Andreas Moshovos
FPRaker: A Processing Element For Accelerating Neural Network Training.
MICRO
(2021)
Mostafa Mahmoud
,
Isak Edo Vivancos
,
Ali Hadi Zadeh
,
Omar Mohamed Awad
,
Gennady Pekhimenko
,
Jorge Albericio
,
Andreas Moshovos
TensorDash: Exploiting Sparsity to Accelerate Deep Neural Network Training and Inference.
CoRR
(2020)
Mostafa Mahmoud
,
Isak Edo
,
Ali Hadi Zadeh
,
Omar Mohamed Awad
,
Gennady Pekhimenko
,
Jorge Albericio
,
Andreas Moshovos
TensorDash: Exploiting Sparsity to Accelerate Deep Neural Network Training.
MICRO
(2020)
Ali Hadi Zadeh
,
Andreas Moshovos
GOBO: Quantizing Attention-Based NLP Models for Low Latency and Energy Efficient Inference.
CoRR
(2020)
Ali Hadi Zadeh
,
Isak Edo
,
Omar Mohamed Awad
,
Andreas Moshovos
GOBO: Quantizing Attention-Based NLP Models for Low Latency and Energy Efficient Inference.
MICRO
(2020)
Omar Mohamed Awad
,
Mostafa Mahmoud
,
Isak Edo Vivancos
,
Ali Hadi Zadeh
,
Ciaran Bannon
,
Anand Jayarajan
,
Gennady Pekhimenko
,
Andreas Moshovos
FPRaker: A Processing Element For Accelerating Neural Network Training.
CoRR
(2020)
Ali Hadi Zadeh
,
Zissis Poulos
,
Andreas Moshovos
Deep Learning Language Modeling Workloads: Where Time Goes on Graphics Processors.
IISWC
(2019)