Sign in
Haojun Xia
ORCID
Publication Activity (10 Years)
Years Active: 2021-2024
Publications (10 Years): 14
Top Topics
Multithreading
Sat Solvers
Pattern Matching
Neural Network Training
Top Venues
CoRR
ISPA/BDCloud/SocialCom/SustainCom
CSCWD
MICRO
</>
Publications
</>
Haojun Xia
,
Zhen Zheng
,
Xiaoxia Wu
,
Shiyang Chen
,
Zhewei Yao
,
Stephen Youn
,
Arash Bakhtiari
,
Michael Wyatt
,
Donglin Zhuang
,
Zhongzhu Zhou
,
Olatunji Ruwase
,
Yuxiong He
,
Shuaiwen Leon Song
FP6-LLM: Efficiently Serving Large Language Models Through FP6-Centric Algorithm-System Co-Design.
CoRR
(2024)
Lei Gong
,
Chao Wang
,
Haojun Xia
,
Xianglan Chen
,
Xi Li
,
Xuehai Zhou
Enabling Fast and Memory-Efficient Acceleration for Pattern Matching Workloads: The Lightweight Automata Processing Engine.
IEEE Trans. Computers
72 (4) (2023)
Haojun Xia
,
Zhen Zheng
,
Yuchao Li
,
Donglin Zhuang
,
Zhongzhu Zhou
,
Xiafei Qiu
,
Yong Li
,
Wei Lin
,
Shuaiwen Leon Song
Flash-LLM: Enabling Cost-Effective and Highly-Efficient Large Generative Model Inference with Unstructured Sparsity.
CoRR
(2023)
Haojun Xia
,
Zhen Zheng
,
Yuchao Li
,
Donglin Zhuang
,
Zhongzhu Zhou
,
Xiafei Qiu
,
Yong Li
,
Wei Lin
,
Shuaiwen Leon Song
Flash-LLM: Enabling Low-Cost and Highly-Efficient Large Generative Model Inference With Unstructured Sparsity.
Proc. VLDB Endow.
17 (2) (2023)
Xiaoxia Wu
,
Haojun Xia
,
Stephen Youn
,
Zhen Zheng
,
Shiyang Chen
,
Arash Bakhtiari
,
Michael Wyatt
,
Reza Yazdani Aminabadi
,
Yuxiong He
,
Olatunji Ruwase
,
Leon Song
,
Zhewei Yao
ZeroQuant(4+2): Redefining LLMs Quantization with a New FP6-Centric Strategy for Diverse Generative Tasks.
CoRR
(2023)
Hongwei Liu
,
Haojun Xia
,
Bibo Tu
Secure and Efficient BMC-Based Centralized Management Method for Large-Scale Data Centers.
HPCC/DSS/SmartCity/DependSys
(2022)
Hongwei Liu
,
Haojun Xia
,
Bibo Tu
,
Da Zhang
,
Xiaotong Wang
A Secure and Efficient USB-based In-band Communication Interface between Host and BMC.
ISPA/BDCloud/SocialCom/SustainCom
(2022)
Qi Wang
,
Haojun Xia
,
Kun Zhang
,
Bibo Tu
Evaluation and Optimization on Virtualization Performance Cost under Semantic Gap.
CSCWD
(2022)
Kunli Lin
,
Haojun Xia
,
Kun Zhang
,
Bibo Tu
AddrArmor: An Address-based Runtime Code-reuse Attack Mitigation for Shared Objects at the Binary-level.
ISPA/BDCloud/SocialCom/SustainCom
(2021)
Haojun Xia
,
Lei Gong
,
Chao Wang
,
Xianglan Chen
,
Xuehai Zhou
LAP: A Lightweight Automata Processor for Pattern Matching Tasks.
DATE
(2021)
Xingyao Zhang
,
Haojun Xia
,
Donglin Zhuang
,
Hao Sun
,
Xin Fu
,
Michael B. Taylor
,
Shuaiwen Leon Song
η-LSTM: Co-Designing Highly-Efficient Large LSTM Training via Exploiting Memory-Saving and Architectural Design Opportunities.
ISCA
(2021)
Qiyu Wan
,
Haojun Xia
,
Xingyao Zhang
,
Lening Wang
,
Shuaiwen Leon Song
,
Xin Fu
Shift-BNN: Highly-Efficient Probabilistic Bayesian Neural Network Training via Memory-Friendly Pattern Retrieving.
CoRR
(2021)
Kunli Lin
,
Wenqing Liu
,
Kun Zhang
,
Haojun Xia
,
Bibo Tu
HyperKRP: A Kernel Runtime Security Architecture with A Tiny Hypervisor on Commodity Hardware.
GLOBECOM
(2021)
Qiyu Wan
,
Haojun Xia
,
Xingyao Zhang
,
Lening Wang
,
Shuaiwen Leon Song
,
Xin Fu
Shift-BNN: Highly-Efficient Probabilistic Bayesian Neural Network Training via Memory-Friendly Pattern Retrieving.
MICRO
(2021)