Sign in
Ningxin Zheng
ORCID
Publication Activity (10 Years)
Years Active: 2018-2023
Publications (10 Years): 24
Top Topics
Motion Field Estimation
Qos Aware
Memory Efficient
Iterative Deepening
Top Venues
CoRR
SC
OSDI
MobiSys
</>
Publications
</>
Weihao Cui
,
Zhenhua Han
,
Lingji Ouyang
,
Yichuan Wang
,
Ningxin Zheng
,
Lingxiao Ma
,
Yuqing Yang
,
Fan Yang
,
Jilong Xue
,
Lili Qiu
,
Lidong Zhou
,
Quan Chen
,
Haisheng Tan
,
Minyi Guo
Optimizing Dynamic Neural Networks with Brainstorm.
OSDI
(2023)
Ningxin Zheng
,
Huiqiang Jiang
,
Quanlu Zhang
,
Zhenhua Han
,
Lingxiao Ma
,
Yuqing Yang
,
Fan Yang
,
Chengruidong Zhang
,
Lili Qiu
,
Mao Yang
,
Lidong Zhou
PIT: Optimization of Dynamic Sparse Deep Learning Models via Permutation Invariant Transformation.
SOSP
(2023)
Li Lyna Zhang
,
Xudong Wang
,
Jiahang Xu
,
Quanlu Zhang
,
Yujing Wang
,
Yuqing Yang
,
Ningxin Zheng
,
Ting Cao
,
Mao Yang
SpaceEvo: Hardware-Friendly Search Space Design for Efficient INT8 Inference.
CoRR
(2023)
Xudong Wang
,
Li Lyna Zhang
,
Jiahang Xu
,
Quanlu Zhang
,
Yujing Wang
,
Yuqing Yang
,
Ningxin Zheng
,
Ting Cao
,
Mao Yang
SpaceEvo: Hardware-Friendly Search Space Design for Efficient INT8 Inference.
ICCV
(2023)
Guanghao Yin
,
Xinyang Jiang
,
Shan Jiang
,
Zhenhua Han
,
Ningxin Zheng
,
Huan Yang
,
Donglin Bai
,
Haisheng Tan
,
Shouqian Sun
,
Yuqing Yang
,
Dongsheng Li
,
Lili Qiu
Online Video Streaming Super-Resolution with Adaptive Look-Up Table Fusion.
CoRR
(2023)
Ningxin Zheng
,
Huiqiang Jiang
,
Quanlu Zhang
,
Zhenhua Han
,
Yuqing Yang
,
Lingxiao Ma
,
Fan Yang
,
Lili Qiu
,
Mao Yang
,
Lidong Zhou
SparDA: Accelerating Dynamic Sparse Deep Neural Networks via Sparse-Dense Transformation.
CoRR
(2023)
Jun Xiao
,
Xinyang Jiang
,
Ningxin Zheng
,
Huan Yang
,
Yifan Yang
,
Yuqing Yang
,
Dongsheng Li
,
Kin-Man Lam
Online Video Super-Resolution With Convolutional Kernel Bypass Grafts.
IEEE Trans. Multim.
25 (2023)
Xinyu Liu
,
Houwen Peng
,
Ningxin Zheng
,
Yuqing Yang
,
Han Hu
,
Yixuan Yuan
EfficientViT: Memory Efficient Vision Transformer with Cascaded Group Attention.
CVPR
(2023)
Xinyu Liu
,
Houwen Peng
,
Ningxin Zheng
,
Yuqing Yang
,
Han Hu
,
Yixuan Yuan
EfficientViT: Memory Efficient Vision Transformer with Cascaded Group Attention.
CoRR
(2023)
Ningxin Zheng
,
Bin Lin
,
Quanlu Zhang
,
Lingxiao Ma
,
Yuqing Yang
,
Fan Yang
,
Yang Wang
,
Mao Yang
,
Lidong Zhou
SparTA: Deep-Learning Model Sparsity via Tensor-with-Sparsity-Attribute.
OSDI
(2022)
Wei Zhang
,
Quan Chen
,
Ningxin Zheng
,
Weihao Cui
,
Kaihua Fu
,
Minyi Guo
Toward QoS-Awareness and Improved Utilization of Spatial Multitasking GPUs.
IEEE Trans. Computers
71 (4) (2022)
Jun Xiao
,
Xinyang Jiang
,
Ningxin Zheng
,
Huan Yang
,
Yifan Yang
,
Yuqing Yang
,
Dongsheng Li
,
Kin-Man Lam
Online Video Super-Resolution with Convolutional Kernel Bypass Graft.
CoRR
(2022)
Wei Zhang
,
Quan Chen
,
Kaihua Fu
,
Ningxin Zheng
,
Zhiyi Huang
,
Jingwen Leng
,
Minyi Guo
Astraea: towards QoS-aware and resource-efficient multi-stage GPU services.
ASPLOS
(2022)
Kaihua Fu
,
Jiuchen Shi
,
Quan Chen
,
Ningxin Zheng
,
Wei Zhang
,
Deze Zeng
,
Minyi Guo
QoS-Aware Irregular Collaborative Inference for Improving Throughput of DNN Services.
SC
(2022)
Li Lyna Zhang
,
Shihao Han
,
Jianyu Wei
,
Ningxin Zheng
,
Ting Cao
,
Yunxin Liu
nn-METER: Towards Accurate Latency Prediction of DNN Inference on Diverse Edge Devices.
GetMobile Mob. Comput. Commun.
25 (4) (2021)
Bo Li
,
Xinyang Jiang
,
Donglin Bai
,
Yuge Zhang
,
Ningxin Zheng
,
Xuanyi Dong
,
Lu Liu
,
Yuqing Yang
,
Dongsheng Li
Full-Cycle Energy Consumption Benchmark for Low-Carbon Computer Vision.
CoRR
(2021)
Li Lyna Zhang
,
Shihao Han
,
Jianyu Wei
,
Ningxin Zheng
,
Ting Cao
,
Yuqing Yang
,
Yunxin Liu
nn-Meter: towards accurate latency prediction of deep-learning model inference on diverse edge devices.
MobiSys
(2021)
Weihao Cui
,
Han Zhao
,
Quan Chen
,
Ningxin Zheng
,
Jingwen Leng
,
Jieru Zhao
,
Zhuo Song
,
Tao Ma
,
Yong Yang
,
Chao Li
,
Minyi Guo
Enable simultaneous DNN services based on deterministic operator overlap and precise latency prediction.
SC
(2021)
Wei Zhang
,
Kaihua Fu
,
Ningxin Zheng
,
Quan Chen
,
Chao Li
,
Wenli Zheng
,
Minyi Guo
CHARM: Collaborative Host and Accelerator Resource Management for GPU Datacenters.
ICCD
(2021)
Wei Zhang
,
Quan Chen
,
Kaihua Fu
,
Ningxin Zheng
,
Zhiyi Huang
,
Jingwen Leng
,
Chao Li
,
Wenli Zheng
,
Minyi Guo
Towards QoS-Aware and Resource-Efficient GPU Microservices Based on Spatial Multitasking GPUs In Datacenters.
CoRR
(2020)
Wei Zhang
,
Ningxin Zheng
,
Quan Chen
,
Yong Yang
,
Zhuo Song
,
Tao Ma
,
Jingwen Leng
,
Minyi Guo
URSA: Precise Capacity Planning and Fair Scheduling based on Low-level Statistics for Public Clouds.
ICPP
(2020)
Ningxin Zheng
,
Quan Chen
,
Yong Yang
,
Wei Zhang
,
Jin Li
,
Wenli Zheng
,
Minyi Guo
URSA: Precise Capacity Planning and Contention-aware Scheduling for Public Clouds.
CoRR
(2019)
Ningxin Zheng
,
Quan Chen
,
Yong Yang
,
Jin Li
,
Wenli Zheng
,
Minyi Guo
POSTER: Precise Capacity Planning for Database Public Clouds.
PACT
(2019)
Ningxin Zheng
,
Quan Chen
,
Chen Chen
,
Minyi Guo
CLIBE: Precise Cluster-Level I/O Bandwidth Enforcement in Distributed File System.
HPCC/SmartCity/DSS
(2018)