​
Login / Signup
Han Zhao
ORCID
Publication Activity (10 Years)
Years Active: 2016-2024
Publications (10 Years): 15
Top Topics
Efficient Processing
Resource Management
Memory Bandwidth
Credit Card Fraud Detection
Top Venues
CoRR
IEEE Trans. Computers
Trustcom/BigDataSE/ISPA
ASPLOS (3)
</>
Publications
</>
Chuhao Xu
,
Yiyu Liu
,
Zijun Li
,
Quan Chen
,
Han Zhao
,
Deze Zeng
,
Qian Peng
,
Xueqi Wu
,
Haifeng Zhao
,
Senbo Fu
,
Minyi Guo
FaaSMem: Improving Memory Efficiency of Serverless Computing with Memory Pool Architecture.
ASPLOS (3)
(2024)
Chunyu Xue
,
Weihao Cui
,
Han Zhao
,
Quan Chen
,
Shulai Zhang
,
Pengyu Yang
,
Jing Yang
,
Shaobo Li
,
Minyi Guo
A Codesign of Scheduling and Parallelization for Large Model Training in Heterogeneous Clusters.
CoRR
(2024)
Han Zhao
,
Weihao Cui
,
Quan Chen
,
Shulai Zhang
,
Zijun Li
,
Jingwen Leng
,
Chao Li
,
Deze Zeng
,
Minyi Guo
Towards Fast Setup and High Throughput of GPU Serverless Computing.
CoRR
(2024)
Han Zhao
,
Weihao Cui
,
Quan Chen
,
Minyi Guo
ISPA: Exploiting Intra-SM Parallelism in GPUs via Fine-Grained Resource Management.
IEEE Trans. Computers
72 (5) (2023)
Binghao Chen
,
Han Zhao
,
Weihao Cui
,
Yifu He
,
Shulai Zhang
,
Quan Chen
,
Zijun Li
,
Minyi Guo
Maximizing the Utilization of GPUs Used by Cloud Gaming through Adaptive Co-location with Combo.
SoCC
(2023)
Han Zhao
,
Weihao Cui
,
Quan Chen
,
Jingwen Leng
,
Deze Zeng
,
Minyi Guo
Improving Cluster Utilization Through Adaptive Resource Management for Deep Neural Network and CPU Jobs Colocation.
IEEE Trans. Computers
72 (12) (2023)
Weihao Cui
,
Han Zhao
,
Quan Chen
,
Hao Wei
,
Zirui Li
,
Deze Zeng
,
Chao Li
,
Minyi Guo
DVABatch: Diversity-aware Multi-Entry Multi-Exit Batching for Efficient Processing of DNN Services on GPUs.
USENIX Annual Technical Conference
(2022)
Han Zhao
,
Weihao Cui
,
Quan Chen
,
Youtao Zhang
,
Yanchao Lu
,
Chao Li
,
Jingwen Leng
,
Minyi Guo
Tacker: Tensor-CUDA Core Kernel Fusion for Improving the GPU Utilization while Ensuring QoS.
HPCA
(2022)
Weihao Cui
,
Quan Chen
,
Han Zhao
,
Mengze Wei
,
Xiaoxin Tang
,
Minyi Guo
bird: Enhanced Elastic Batch for Improving Responsiveness and Throughput of Deep Learning Services.
IEEE Trans. Parallel Distributed Syst.
32 (6) (2021)
Weihao Cui
,
Han Zhao
,
Quan Chen
,
Ningxin Zheng
,
Jingwen Leng
,
Jieru Zhao
,
Zhuo Song
,
Tao Ma
,
Yong Yang
,
Chao Li
,
Minyi Guo
Enable simultaneous DNN services based on deterministic operator overlap and precise latency prediction.
SC
(2021)
Han Zhao
,
Weihao Cui
,
Quan Chen
,
Jieru Zhao
,
Jingwen Leng
,
Minyi Guo
Exploiting Intra-SM Parallelism in GPUs via Persistent and Elastic Blocks.
ICCD
(2021)
Han Zhao
,
Weihao Cui
,
Quan Chen
,
Jingwen Leng
,
Kai Yu
,
Deze Zeng
,
Chao Li
,
Minyi Guo
CODA: Improving Resource Utilization by Slimming and Co-locating DNN and CPU Jobs.
ICDCS
(2020)
Han Zhao
,
Quan Chen
,
Yuxian Qiu
,
Ming Wu
,
Yao Shen
,
Jingwen Leng
,
Chao Li
,
Minyi Guo
Bandwidth and Locality Aware Task-stealing for Manycore Architectures with Bandwidth-Asymmetric Memory.
ACM Trans. Archit. Code Optim.
15 (4) (2019)
You Dai
,
Jin Yan
,
Xiaoxin Tang
,
Han Zhao
,
Minyi Guo
Online Credit Card Fraud Detection: A Hybrid Framework with Big Data Technologies.
Trustcom/BigDataSE/ISPA
(2016)
Shanshan Chen
,
Xiaoxin Tang
,
Hongwei Wang
,
Han Zhao
,
Minyi Guo
Towards Scalable and Reliable In-Memory Storage System: A Case Study with Redis.
Trustcom/BigDataSE/ISPA
(2016)