​
Login / Signup
Yulong Ao
Publication Activity (10 Years)
Years Active: 2015-2021
Publications (10 Years): 13
Top Topics
Constraint Solver
Highly Efficient
Input Vectors
Sat Solvers
Top Venues
ICPP
CoRR
Clust. Comput.
ACM Trans. Archit. Code Optim.
</>
Publications
</>
Yulong Ao
,
Zhihua Wu
,
Dianhai Yu
,
Weibao Gong
,
Zhiqing Kui
,
Minxu Zhang
,
Zilingfeng Ye
,
Liang Shen
,
Yanjun Ma
,
Tian Wu
,
Haifeng Wang
,
Wei Zeng
,
Chao Yang
End-to-end Adaptive Distributed Training on PaddlePaddle.
CoRR
(2021)
Min Li
,
Yulong Ao
,
Chao Yang
Adaptive SpMV/SpMSpV on GPUs for Input Vectors of Varied Sparsity.
IEEE Trans. Parallel Distributed Syst.
32 (7) (2021)
Peng Zhang
,
Chao Yang
,
Yulong Ao
AutoWM: a novel domain-specific tool for universal multi-/many-core accelerations of the WRF cloud microphysics.
Clust. Comput.
24 (2) (2021)
Min Li
,
Yulong Ao
,
Chao Yang
Adaptive SpMV/SpMSpV on GPUs for Input Vectors of Varied Sparsity.
CoRR
(2020)
Wenjing Ma
,
Yulong Ao
,
Chao Yang
,
Samuel Williams
Solving a trillion unknowns per second with HPGMG on Sunway TaihuLight.
Clust. Comput.
23 (2) (2020)
Min Li
,
Chao Yang
,
Qiao Sun
,
Wenjing Ma
,
Wenlong Cao
,
Yulong Ao
Enabling Highly Efficient k-Means Computations on the SW26010 Many-Core Processor of Sunway TaihuLight.
J. Comput. Sci. Technol.
34 (1) (2019)
Ying Cai
,
Chao Yang
,
Wenjing Ma
,
Yulong Ao
Extreme-Scale Realistic Stencil Computations on Sunway TaihuLight with Ten Million Cores.
CCGrid
(2018)
Ying Cai
,
Yulong Ao
,
Chao Yang
,
Wenjing Ma
,
Haitao Zhao
Extreme-Scale High-Order WENO Simulations of 3-D Detonation Wave with 10 Million Cores.
ACM Trans. Archit. Code Optim.
15 (2) (2018)
Xinliang Wang
,
Ping Xu
,
Wei Xue
,
Yulong Ao
,
Chao Yang
,
Haohuan Fu
,
Lin Gan
,
Guangwen Yang
,
Weimin Zheng
A Fast Sparse Triangular Solver for Structured-grid Problems on Sunway Many-core Processor SW26010.
ICPP
(2018)
Yulong Ao
,
Chao Yang
,
Fangfang Liu
,
Wanwang Yin
,
Lijuan Jiang
,
Qiao Sun
Performance Optimization of the HPCG Benchmark on the Sunway TaihuLight Supercomputer.
ACM Trans. Archit. Code Optim.
15 (1) (2018)
Yulong Ao
,
Chao Yang
,
Xinliang Wang
,
Wei Xue
,
Haohuan Fu
,
Fangfang Liu
,
Lin Gan
,
Ping Xu
,
Wenjing Ma
26 PFLOPS Stencil Computations for Atmospheric Modeling on Sunway TaihuLight.
IPDPS
(2017)
Lijuan Jiang
,
Chao Yang
,
Yulong Ao
,
Wanwang Yin
,
Wenjing Ma
,
Qiao Sun
,
Fangfang Liu
,
Rongfen Lin
,
Peng Zhang
Towards Highly Efficient DGEMM on the Emerging SW26010 Many-Core Processor.
ICPP
(2017)
Chao Yang
,
Wei Xue
,
Haohuan Fu
,
Hongtao You
,
Xinliang Wang
,
Yulong Ao
,
Fangfang Liu
,
Lin Gan
,
Ping Xu
,
Lanning Wang
,
Guangwen Yang
,
Weimin Zheng
10M-core scalable fully-implicit solver for nonhydrostatic atmospheric dynamics.
SC
(2016)
Peng Zhang
,
Yulong Ao
,
Chao Yang
,
Yiqung Liu
,
Fangfang Liu
,
Changmao Wu
,
Haitao Zhao
Pattern-Driven Hybrid Multi- and Many-Core Acceleration in the MPAS Shallow-Water Model.
ICPP
(2015)
Yulong Ao
,
Yiqung Liu
,
Chao Yang
,
Fangfang Liu
,
Peng Zhang
,
Yutong Lu
,
Yunfei Du
Performance Evaluation of HPGMG on Tianhe-2: Early Experience.
ICA3PP (4)
(2015)