​
Login / Signup
Yaosheng Fu
Publication Activity (10 Years)
Years Active: 2014-2023
Publications (10 Years): 13
Top Topics
Deep Learning
Parallel Processing
Open Source
Diagnostic Problem Solving
Top Venues
CoRR
IEEE Micro
HPCA
ASPLOS
</>
Publications
</>
Yaosheng Fu
,
Evgeny Bolotin
,
Aamer Jaleel
,
Gal Dalal
,
Shie Mannor
,
Jacob Subag
,
Noam Korem
,
Michael Behar
,
David W. Nellans
AutoScratch: ML-Optimized Cache Management for Inference-Oriented GPUs.
MLSys
(2023)
Yaosheng Fu
,
Evgeny Bolotin
,
Niladrish Chatterjee
,
David W. Nellans
,
Stephen W. Keckler
GPU Domain Specialization via Composable On-Package Architecture.
ACM Trans. Archit. Code Optim.
19 (1) (2022)
Yaosheng Fu
,
Evgeny Bolotin
,
Niladrish Chatterjee
,
David W. Nellans
,
Stephen W. Keckler
GPU Domain Specialization via Composable On-Package Architecture.
CoRR
(2021)
Oreste Villa
,
Daniel Lustig
,
Zi Yan
,
Evgeny Bolotin
,
Yaosheng Fu
,
Niladrish Chatterjee
,
Nan Jiang
,
David W. Nellans
Need for Speed: Experiences Building a Trustworthy System-Level GPU Simulator.
HPCA
(2021)
Jonathan Balkind
,
Katie Lim
,
Michael Schaffner
,
Fei Gao
,
Grigory Chirkov
,
Ang Li
,
Alexey Lavrov
,
Tri M. Nguyen
,
Yaosheng Fu
,
Florian Zaruba
,
Kunal Gulati
,
Luca Benini
,
David Wentzlaff
BYOC: A "Bring Your Own Core" Framework for Heterogeneous-ISA Research.
ASPLOS
(2020)
Ahmet Fatih Inci
,
Evgeny Bolotin
,
Yaosheng Fu
,
Gal Dalal
,
Shie Mannor
,
David W. Nellans
,
Diana Marculescu
The Architectural Implications of Distributed Reinforcement Learning on CPU-GPU Systems.
CoRR
(2020)
Saptadeep Pal
,
Eiman Ebrahimi
,
Arslan Zulfiqar
,
Yaosheng Fu
,
Victor Zhang
,
Szymon Migacz
,
David W. Nellans
,
Puneet Gupta
Optimizing Multi-GPU Parallelization Strategies for Deep Learning Training.
CoRR
(2019)
Jonathan Balkind
,
Michael McKeown
,
Yaosheng Fu
,
Tri Minh Nguyen
,
Yanqi Zhou
,
Alexey Lavrov
,
Mohammad Shahrad
,
Adi Fuchs
,
Samuel Payne
,
Xiaohua Liang
,
Matthew Matl
,
David Wentzlaff
OpenPiton: an open source hardware platform for your research.
Commun. ACM
62 (12) (2019)
Saptadeep Pal
,
Eiman Ebrahimi
,
Arslan Zulfiqar
,
Yaosheng Fu
,
Victor Zhang
,
Szymon Migacz
,
David W. Nellans
,
Puneet Gupta
Optimizing Multi-GPU Parallelization Strategies for Deep Learning Training.
IEEE Micro
39 (5) (2019)
Michael McKeown
,
Alexey Lavrov
,
Mohammad Shahrad
,
Paul J. Jackson
,
Yaosheng Fu
,
Jonathan Balkind
,
Tri Minh Nguyen
,
Katie Lim
,
Yanqi Zhou
,
David Wentzlaff
Power and Energy Characterization of an Open Source 25-Core Manycore Processor.
HPCA
(2018)
Michael McKeown
,
Yaosheng Fu
,
Tri Minh Nguyen
,
Yanqi Zhou
,
Jonathan Balkind
,
Alexey Lavrov
,
Mohammad Shahrad
,
Samuel Payne
,
David Wentzlaff
Piton: A Manycore Processor for Multitenant Clouds.
IEEE Micro
37 (2) (2017)
Michael McKeown
,
Yaosheng Fu
,
Tri Minh Nguyen
,
Yanqi Zhou
,
Jonathan Balkind
,
Alexey Lavrov
,
Mohammad Shahrad
,
Samuel Payne
,
David Wentzlaff
Piton: A 25-core academic manycore research processor.
Hot Chips Symposium
(2016)
Jonathan Balkind
,
Michael McKeown
,
Yaosheng Fu
,
Tri Minh Nguyen
,
Yanqi Zhou
,
Alexey Lavrov
,
Mohammad Shahrad
,
Adi Fuchs
,
Samuel Payne
,
Xiaohua Liang
,
Matthew Matl
,
David Wentzlaff
OpenPiton: An Open Source Manycore Research Framework.
ASPLOS
(2016)
Yaosheng Fu
,
Tri Minh Nguyen
,
David Wentzlaff
Coherence domain restriction on large scale systems.
MICRO
(2015)
Yaosheng Fu
,
David Wentzlaff
PriME: A parallel and distributed simulator for thousand-core chips.
ISPASS
(2014)