​
Login / Signup
Yi Yang
ORCID
Publication Activity (10 Years)
Years Active: 2010-2022
Publications (10 Years): 11
Top Topics
Stochastic Gradient Descent
Neural Network Training
Memory Bandwidth
Data Placement
Top Venues
CoRR
ICS
SC
LCPC
</>
Publications
</>
Yi Yang
,
Murugan Sankaradas
,
Srimat Chakradhar
DyCo: Dynamic, Contextualized AI Models.
ACM Trans. Embed. Comput. Syst.
21 (6) (2022)
Biplob Debnath
,
Giuseppe Coviello
,
Yi Yang
,
Srimat Chakradhar
UAC: An Uncertainty-Aware Face Clustering Algorithm.
ICCVW
(2021)
Giuseppe Coviello
,
Yi Yang
,
Kunal Rao
,
Srimat T. Chakradhar
Magic-Pipe: self-optimizing video analytics pipelines.
Middleware
(2021)
Kunal Rao
,
Giuseppe Coviello
,
Min Feng
,
Biplob Debnath
,
Wang-Pin Hsiung
,
Murugan Sankaradass
,
Yi Yang
,
Oliver Po
,
Utsav Drolia
,
Srimat T. Chakradhar
S: Free Flow Fever Screening.
SMARTCOMP
(2021)
Kunal Rao
,
Giuseppe Coviello
,
Min Feng
,
Biplob Debnath
,
Wang-Pin Hsiung
,
Murugan Sankaradass
,
Yi Yang
,
Oliver Po
,
Utsav Drolia
,
Srimat T. Chakradhar
F3S: Free Flow Fever Screening.
CoRR
(2021)
Linnan Wang
,
Yi Yang
,
Renqiang Min
,
Srimat T. Chakradhar
Accelerating deep neural network training with inconsistent stochastic gradient descent.
Neural Networks
93 (2017)
Chao Li
,
Yi Yang
,
Min Feng
,
Srimat T. Chakradhar
,
Huiyang Zhou
Optimizing memory efficiency for deep convolutional neural networks on GPUs.
SC
(2016)
Yi Yang
,
Min Feng
,
Srimat T. Chakradhar
HppCnn: A High-Performance, Portable Deep-Learning Library for GPGPUs.
ICPP
(2016)
Linnan Wang
,
Wei Wu
,
Zenglin Xu
,
Jianxiong Xiao
,
Yi Yang
BLASX: A High Performance Level-3 BLAS Library for Heterogeneous Multi-GPU Computing.
ICS
(2016)
Chao Li
,
Yi Yang
,
Min Feng
,
Srimat T. Chakradhar
,
Huiyang Zhou
Optimizing Memory Efficiency for Deep Convolutional Neural Networks on GPUs.
CoRR
(2016)
Linnan Wang
,
Yi Yang
,
Martin Renqiang Min
,
Srimat T. Chakradhar
Accelerating Deep Neural Network Training with Inconsistent Stochastic Gradient Descent.
CoRR
(2016)
Ping Xiang
,
Yi Yang
,
Mike Mantor
,
Norm Rubin
,
Huiyang Zhou
Revisiting ILP Designs for Throughput-Oriented GPGPU Architecture.
CCGRID
(2015)
Linnan Wang
,
Wei Wu
,
Jianxiong Xiao
,
Yi Yang
BLASX: A High Performance Level-3 BLAS Library for Heterogeneous Multi-GPU Computing.
CoRR
(2015)
Bin Ren
,
Nishkam Ravi
,
Yi Yang
,
Min Feng
,
Gagan Agrawal
,
Srimat T. Chakradhar
Automatic and Efficient Data Host-Device Communication for Many-Core Coprocessors.
LCPC
(2015)
Yi Yang
,
Chao Li
,
Huiyang Zhou
CUDA-NP: Realizing Nested Thread-Level Parallelism in GPGPU Applications.
J. Comput. Sci. Technol.
30 (1) (2015)
Chao Li
,
Yi Yang
,
Zhen Lin
,
Huiyang Zhou
Automatic data placement into GPU on-chip memory resources.
CGO
(2015)
Bin Ren
,
Nishkam Ravi
,
Yi Yang
,
Min Feng
,
Gagan Agrawal
,
Srimat T. Chakradhar
Automating and optimizing data transfers for many-core coprocessors.
ICS
(2014)
Chao Li
,
Yi Yang
,
Hongwen Dai
,
Shengen Yan
,
Frank Mueller
,
Huiyang Zhou
Understanding the tradeoffs between software-managed vs. hardware-managed caches in GPUs.
ISPASS
(2014)
Linhai Song
,
Min Feng
,
Nishkam Ravi
,
Yi Yang
,
Srimat T. Chakradhar
COMP: Compiler Optimizations for Manycore Processors.
MICRO
(2014)
Yi Yang
,
Ping Xiang
,
Michael Mantor
,
Norman Rubin
,
Lisa R. Hsu
,
Qunfeng Dong
,
Huiyang Zhou
A Case for a Flexible Scalar Unit in SIMT Architecture.
IPDPS
(2014)
Ping Xiang
,
Yi Yang
,
Huiyang Zhou
Warp-level divergence in GPUs: Characterization, impact, and mitigation.
HPCA
(2014)
Yi Yang
,
Huiyang Zhou
CUDA-NP: realizing nested thread-level parallelism in GPGPU applications.
PPOPP
(2014)
Nishkam Ravi
,
Yi Yang
,
Tao Bao
,
Srimat T. Chakradhar
Semi-automatic restructuring of offloadable tasks for many-core accelerators.
SC
(2013)
Yi Yang
,
Huiyang Zhou
The Implementation of a High Performance GPGPU Compiler.
Int. J. Parallel Program.
41 (6) (2013)
Saurabh Gupta
,
Ping Xiang
,
Yi Yang
,
Huiyang Zhou
Locality principle revisited: A probability-based quantitative approach.
J. Parallel Distributed Comput.
73 (7) (2013)
Ping Xiang
,
Yi Yang
,
Mike Mantor
,
Norm Rubin
,
Lisa R. Hsu
,
Huiyang Zhou
Exploiting uniform vector instructions for GPGPU performance, energy efficiency, and opportunistic reliability enhancement.
ICS
(2013)
Yi Yang
,
Ping Xiang
,
Mike Mantor
,
Huiyang Zhou
Fixing Performance Bugs: An Empirical Study of Open-Source GPGPU Programs.
ICPP
(2012)
Nishkam Ravi
,
Yi Yang
,
Tao Bao
,
Srimat T. Chakradhar
Apricot: an optimizing compiler and productivity tool for x86-compatible many-core coprocessors.
ICS
(2012)
Ping Xiang
,
Yi Yang
,
Mike Mantor
,
Norm Rubin
,
Huiyang Zhou
Many-thread aware instruction-level parallelism: architecting shader cores for GPU computing.
PACT
(2012)
Yi Yang
,
Ping Xiang
,
Mike Mantor
,
Norm Rubin
,
Huiyang Zhou
Shared memory multiplexing: a novel way to improve GPGPU throughput.
PACT
(2012)
Saurabh Gupta
,
Ping Xiang
,
Yi Yang
,
Huiyang Zhou
Locality Principle Revisited: A Probability-Based Quantitative Approach.
IPDPS
(2012)
Yi Yang
,
Ping Xiang
,
Mike Mantor
,
Huiyang Zhou
CPU-assisted GPGPU on fused CPU-GPU architectures.
HPCA
(2012)
Yi Yang
,
Ping Xiang
,
Jingfei Kong
,
Mike Mantor
,
Huiyang Zhou
A unified optimizing compiler framework for different GPGPU architectures.
ACM Trans. Archit. Code Optim.
9 (2) (2012)
Yi Yang
,
Ping Xiang
,
Jingfei Kong
,
Huiyang Zhou
A GPGPU compiler for memory optimization and parallelism management.
PLDI
(2010)
Jingfei Kong
,
Martin Dimitrov
,
Yi Yang
,
Janaka Liyanage
,
Lin Cao
,
Jacob Staples
,
Mike Mantor
,
Huiyang Zhou
Accelerating MATLAB Image Processing Toolbox functions on GPUs.
GPGPU
(2010)
Yi Yang
,
Ping Xiang
,
Jingfei Kong
,
Huiyang Zhou
An optimizing compiler for GPGPU programs with input-data sharing.
PPOPP
(2010)