​
Login / Signup
Size Zheng
ORCID
Publication Activity (10 Years)
Years Active: 2020-2024
Publications (10 Years): 18
Top Topics
Bit Wise
Uniform Quantization
Garbage Collection
Graph Transformation
Top Venues
CoRR
ASPLOS (3)
ISCA
DAC
</>
Publications
</>
Renze Chen
,
Zijian Ding
,
Size Zheng
,
Chengrui Zhang
,
Jingwen Leng
,
Xuanzhe Liu
,
Yun Liang
MAGIS: Memory Optimization via Coordinated Graph Transformation and Scheduling for DNN.
ASPLOS (3)
(2024)
Size Zheng
,
Renze Chen
,
Meng Li
,
Zihao Ye
,
Luis Ceze
,
Yun Liang
vMCU: Coordinated Memory Management and Kernel Optimization for DNN Inference on MCUs.
CoRR
(2024)
Liqiang Lu
,
Zizhang Luo
,
Size Zheng
,
Jieming Yin
,
Jason Cong
,
Yun Liang
,
Jianwei Yin
Rubick: A Unified Infrastructure for Analyzing, Exploring, and Implementing Spatial Architectures via Dataflow Decomposition.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst.
43 (4) (2024)
Yilong Zhao
,
Chien-Yu Lin
,
Kan Zhu
,
Zihao Ye
,
Lequn Chen
,
Size Zheng
,
Luis Ceze
,
Arvind Krishnamurthy
,
Tianqi Chen
,
Baris Kasikci
Atom: Low-Bit Quantization for Efficient and Accurate LLM Serving.
MLSys
(2024)
Size Zheng
,
Renze Chen
,
Meng Li
,
Zihao Ye
,
Luis Ceze
,
Yun Liang
vMCU: Coordinated Memory Management and Kernel Optimization for DNN Inference on MCUs.
MLSys
(2024)
Cong Li
,
Zhe Zhou
,
Size Zheng
,
Jiaxi Zhang
,
Yun Liang
,
Guangyu Sun
SpecPIM: Accelerating Speculative Inference on PIM-Enabled System via Architecture-Dataflow Co-Exploration.
ASPLOS (3)
(2024)
Size Zheng
,
Siyuan Chen
,
Peidi Song
,
Renze Chen
,
Xiuhong Li
,
Shengen Yan
,
Dahua Lin
,
Jingwen Leng
,
Yun Liang
Chimera: An Analytical Optimizing Framework for Effective Compute-intensive Operators Fusion.
HPCA
(2023)
Zizhang Luo
,
Liqiang Lu
,
Size Zheng
,
Jieming Yin
,
Jason Cong
,
Jianwei Yin
,
Yun Liang
Rubick: A Synthesis Framework for Spatial Architectures via Dataflow Decomposition.
DAC
(2023)
Xiuping Cui
,
Size Zheng
,
Tianyu Jia
,
Le Ye
,
Yun Liang
ARES: A Mapping Framework of DNNs Towards Diverse PIMs with General Abstractions.
ICCAD
(2023)
Yilong Zhao
,
Chien-Yu Lin
,
Kan Zhu
,
Zihao Ye
,
Lequn Chen
,
Size Zheng
,
Luis Ceze
,
Arvind Krishnamurthy
,
Tianqi Chen
,
Baris Kasikci
Atom: Low-bit Quantization for Efficient and Accurate LLM Serving.
CoRR
(2023)
Size Zheng
,
Siyuan Chen
,
Yun Liang
Memory and Computation Coordinated Mapping of DNNs onto Complex Heterogeneous SoC.
DAC
(2023)
Size Zheng
,
Siyuan Chen
,
Siyuan Gao
,
Liancheng Jia
,
Guangyu Sun
,
Runsheng Wang
,
Yun Liang
TileFlow: A Framework for Modeling Fusion Dataflow via Tree-based Analysis.
MICRO
(2023)
Size Zheng
,
Renze Chen
,
Yicheng Jin
,
Anjiang Wei
,
Bingyang Wu
,
Xiuhong Li
,
Shengen Yan
,
Yun Liang
NeoFlow: A Flexible Framework for Enabling Efficient Compilation for High Performance DNN Training.
IEEE Trans. Parallel Distributed Syst.
33 (11) (2022)
Size Zheng
,
Renze Chen
,
Anjiang Wei
,
Yicheng Jin
,
Qin Han
,
Liqiang Lu
,
Bingyang Wu
,
Xiuhong Li
,
Shengen Yan
,
Yun Liang
AMOS: enabling automatic mapping for tensor computations on spatial accelerators with hardware abstraction.
ISCA
(2022)
Qingcheng Xiao
,
Size Zheng
,
Bingzhe Wu
,
Pengcheng Xu
,
Xuehai Qian
,
Yun Liang
HASCO: Towards Agile HArdware and Software CO-design for Tensor Computation.
ISCA
(2021)
Qingcheng Xiao
,
Size Zheng
,
Bingzhe Wu
,
Pengcheng Xu
,
Xuehai Qian
,
Yun Liang
HASCO: Towards Agile HArdware and Software CO-design for Tensor Computation.
CoRR
(2021)
Yi-Hsiang Lai
,
Hongbo Rong
,
Size Zheng
,
Weihao Zhang
,
Xiuping Cui
,
Yunshan Jia
,
Jie Wang
,
Brendan Sullivan
,
Zhiru Zhang
,
Yun Liang
,
Youhui Zhang
,
Jason Cong
,
Nithin George
,
Jose Alvarez
,
Christopher J. Hughes
,
Pradeep Dubey
SuSy: A Programming Model for Productive Construction of High-Performance Systolic Arrays on FPGAs.
ICCAD
(2020)
Size Zheng
,
Yun Liang
,
Shuo Wang
,
Renze Chen
,
Kaiwen Sheng
FlexTensor: An Automatic Schedule Exploration and Optimization Framework for Tensor Computation on Heterogeneous System.
ASPLOS
(2020)