Login / Signup
Cheng Li
ORCID
Publication Activity (10 Years)
Years Active: 2016-2020
Publications (10 Years): 23
Top Topics
Machine Learning Models
Weakly Supervised
Dt Mri
Deep Learning
Top Venues
CoRR
CLOUD
ICPE
IPDPS
</>
Publications
</>
Cheng Li
,
Abdul Dakkak
,
Jinjun Xiong
,
Wei Wei
,
Lingjie Xu
,
Wen-Mei Hwu
XSP: Across-Stack Profiling and Analysis of Machine Learning Models on GPUs.
IPDPS
(2020)
Abdul Dakkak
,
Cheng Li
,
Jinjun Xiong
,
Wen-Mei Hwu
DLSpec: A Deep Learning Task Exchange Specification.
CoRR
(2020)
Cheng Li
,
Abdul Dakkak
,
Jinjun Xiong
,
Wen-Mei Hwu
Benanza: Automatic μBenchmark Generation to Compute "Lower-bound" Latency and Inform Optimizations of Deep Learning Models on GPUs.
IPDPS
(2020)
Cheng Li
,
Abdul Dakkak
,
Jinjun Xiong
,
Wen-Mei W. Hwu
The Design and Implementation of a Scalable Deep Learning Benchmarking Platform.
CLOUD
(2020)
Abdul Dakkak
,
Cheng Li
,
Jinjun Xiong
,
Wen-Mei Hwu
MLModelScope: A Distributed Platform for Model Evaluation and Benchmarking at Scale.
CoRR
(2020)
Abdul Dakkak
,
Cheng Li
,
Jinjun Xiong
,
Wen-mei W. Hwu
DLSpec: A Deep Learning Task Exchange Specification.
OpML
(2020)
Cheng Li
,
Abdul Dakkak
,
Jinjun Xiong
,
Wen-Mei Hwu
DLBricks: Composable Benchmark Generation to Reduce Deep Learning Benchmarking Effort on CPUs.
ICPE
(2020)
Cheng Li
,
Abdul Dakkak
,
Jinjun Xiong
,
Wen-Mei W. Hwu
DLBricks: Composable Benchmark Generation to Reduce Deep Learning Benchmarking Effort on CPUs.
CoRR
(2019)
Abdul Dakkak
,
Cheng Li
,
Jinjun Xiong
,
Isaac Gelado
,
Wen-Mei W. Hwu
Accelerating reduction and scan using tensor core units.
ICS
(2019)
Carl Pearson
,
Abdul Dakkak
,
Sarah Hashash
,
Cheng Li
,
I-Hsin Chung
,
Jinjun Xiong
,
Wen-Mei Hwu
Evaluating Characteristics of CUDA Communication Primitives on High-Bandwidth Interconnects.
ICPE
(2019)
Cheng Li
,
Abdul Dakkak
,
Jinjun Xiong
,
Wen-Mei W. Hwu
The Design and Implementation of a Scalable DL Benchmarking Platform.
CoRR
(2019)
Cheng Li
,
Abdul Dakkak
,
Jinjun Xiong
,
Wen-Mei Hwu
Benanza: Automatic μBenchmark Generation to Compute "Lower-bound" Latency and Inform Optimizations of Deep Learning Models on GPUs.
CoRR
(2019)
Wei Zhang
,
Wei Wei
,
Lingjie Xu
,
Lingling Jin
,
Cheng Li
AI Matrix: A Deep Learning Benchmark for Alibaba Data Centers.
CoRR
(2019)
Abdul Dakkak
,
Cheng Li
,
Simon Garcia De Gonzalo
,
Jinjun Xiong
,
Wen-Mei Hwu
TrIMS: Transparent and Isolated Model Sharing for Low Latency Deep Learning Inference in Function-as-a-Service.
CLOUD
(2019)
Cheng Li
,
Abdul Dakkak
,
Jinjun Xiong
,
Wen-Mei Hwu
MLModelScope: Evaluate and Introspect Cognitive Pipelines.
SERVICES
(2019)
Cheng Li
,
Abdul Dakkak
,
Jinjun Xiong
,
Wei Wei
,
Lingjie Xu
,
Wen-Mei Hwu
Across-Stack Profiling and Characterization of Machine Learning Models on GPUs.
CoRR
(2019)
Cheng Li
,
Abdul Dakkak
,
Jinjun Xiong
,
Wen-Mei Hwu
Challenges and Pitfalls of Reproducing Machine Learning Artifacts.
CoRR
(2019)
Carl Pearson
,
Abdul Dakkak
,
Cheng Li
,
Sarah Hashash
,
Jinjun Xiong
,
Wen-Mei W. Hwu
SCOPE: C3SR Systems Characterization and Benchmarking Framework.
CoRR
(2018)
Abdul Dakkak
,
Cheng Li
,
Isaac Gelado
,
Jinjun Xiong
,
Wen-Mei W. Hwu
Accelerating Reduction and Scan Using Tensor Core Units.
CoRR
(2018)
Abdul Dakkak
,
Cheng Li
,
Simon Garcia De Gonzalo
,
Jinjun Xiong
,
Wen-Mei W. Hwu
TrIMS: Transparent and Isolated Model Sharing for Low Latency Deep LearningInference in Function as a Service Environments.
CoRR
(2018)
Abdul Dakkak
,
Cheng Li
,
Abhishek Srivastava
,
Jinjun Xiong
,
Wen-Mei W. Hwu
MLModelScope: Evaluate and Measure ML Models within AI Pipelines.
CoRR
(2018)
Abdul Dakkak
,
Carl Pearson
,
Cheng Li
,
Wen-mei W. Hwu
RAI: A Scalable Project Submission System for Parallel Programming Courses.
IPDPS Workshops
(2017)
Izzat El Hajj
,
Juan Gómez-Luna
,
Cheng Li
,
Li-Wen Chang
,
Dejan S. Milojicic
,
Wen-mei W. Hwu
KLAP: Kernel launch aggregation and promotion for optimizing dynamic parallelism.
MICRO
(2016)