C
search
search
reviewers
reviewers
feeds
feeds
assignments
assignments
settings
logout
Arash Bakhtiari
Publication Activity (10 Years)
Years Active: 2016-2024
Publications (10 Years): 6
Top Topics
Graphics Processors
Mass Spectrometry
Text Generation
Proteomic Data
Top Venues
CoRR
SC
Comput.
</>
Publications
</>
Connor Holmes
,
Masahiro Tanaka
,
Michael Wyatt
,
Ammar Ahmad Awan
,
Jeff Rasley
,
Samyam Rajbhandari
,
Reza Yazdani Aminabadi
,
Heyang Qin
,
Arash Bakhtiari
,
Lev Kurilenko
,
Yuxiong He
DeepSpeed-FastGen: High-throughput Text Generation for LLMs via MII and DeepSpeed-Inference.
CoRR
(2024)
Haojun Xia
,
Zhen Zheng
,
Xiaoxia Wu
,
Shiyang Chen
,
Zhewei Yao
,
Stephen Youn
,
Arash Bakhtiari
,
Michael Wyatt
,
Donglin Zhuang
,
Zhongzhu Zhou
,
Olatunji Ruwase
,
Yuxiong He
,
Shuaiwen Leon Song
FP6-LLM: Efficiently Serving Large Language Models Through FP6-Centric Algorithm-System Co-Design.
CoRR
(2024)
Xiaoxia Wu
,
Haojun Xia
,
Stephen Youn
,
Zhen Zheng
,
Shiyang Chen
,
Arash Bakhtiari
,
Michael Wyatt
,
Reza Yazdani Aminabadi
,
Yuxiong He
,
Olatunji Ruwase
,
Leon Song
,
Zhewei Yao
ZeroQuant(4+2): Redefining LLMs Quantization with a New FP6-Centric Strategy for Diverse Generative Tasks.
CoRR
(2023)
Tom Bannink
,
Arash Bakhtiari
,
Adam Hillier
,
Lukas Geiger
,
Tim de Bruin
,
Leon Overweel
,
Jelmer Neeven
,
Koen Helwegen
Larq Compute Engine: Design, Benchmark, and Deploy State-of-the-Art Binarized Neural Networks.
CoRR
(2020)
Christoph Riesinger
,
Arash Bakhtiari
,
Martin Schreiber
,
Philipp Neumann
,
Hans-Joachim Bungartz
A Holistic Scalable Implementation Approach of the Lattice Boltzmann Method for CPU/GPU Heterogeneous Clusters.
Comput.
5 (4) (2017)
Arash Bakhtiari
,
Dhairya Malhotra
,
Amir Raoofy
,
Miriam Mehl
,
Hans-Joachim Bungartz
,
George Biros
A parallel arbitrary-order accurate AMR algorithm for the scalar advection-diffusion equation.
SC
(2016)