​
Login / Signup
Yemao Xu
ORCID
Publication Activity (10 Years)
Years Active: 2016-2023
Publications (10 Years): 14
Top Topics
Deep Learning
Optimization Strategies
Alternating Least Squares
Neural Network Training
Top Venues
CoRR
ACM Trans. Archit. Code Optim.
J. Parallel Distributed Comput.
NAS
</>
Publications
</>
Yemao Xu
,
Dezun Dong
,
Dongsheng Wang
,
Shi Xu
,
Enda Yu
,
Weixia Xu
,
Xiangke Liao
SSD-SGD: Communication Sparsification for Distributed Deep Learning Training.
ACM Trans. Archit. Code Optim.
20 (1) (2023)
Enda Yu
,
Dezun Dong
,
Yemao Xu
,
Shuo Ouyang
,
Xiangke Liao
CP-SGD: Distributed stochastic gradient descent with compression and periodic compensation.
J. Parallel Distributed Comput.
169 (2022)
Enda Yu
,
Dezun Dong
,
Yemao Xu
,
Shuo Ouyang
,
Xiangke Liao
CD-SGD: Distributed Stochastic Gradient Descent with Compression and Delay Compensation.
ICPP
(2021)
Enda Yu
,
Dezun Dong
,
Yemao Xu
,
Shuo Ouyang
,
Xiangke Liao
CD-SGD: Distributed Stochastic Gradient Descent with Compression and Delay Compensation.
CoRR
(2021)
Shuo Ouyang
,
Dezun Dong
,
Yemao Xu
,
Liquan Xiao
Communication optimization strategies for distributed deep neural network training: A survey.
J. Parallel Distributed Comput.
149 (2021)
Yanghai Wang
,
Dezun Dong
,
Yemao Xu
,
Shuo Ouyang
,
Xiangke Liao
FastHorovod: Expediting Parallel Message-Passing Schedule for Distributed DNN Training.
ISCC
(2021)
Shuo Ouyang
,
Dezun Dong
,
Yemao Xu
,
Liquan Xiao
Communication Optimization Strategies for Distributed Deep Learning: A Survey.
CoRR
(2020)
Yemao Xu
,
Dezun Dong
,
Yawei Zhao
,
Weixia Xu
,
Xiangke Liao
OD-SGD: One-Step Delay Stochastic Gradient Descent for Distributed Training.
ACM Trans. Archit. Code Optim.
17 (4) (2020)
Yemao Xu
,
Dezun Dong
,
Weixia Xu
,
Xiangke Liao
OD-SGD: One-step Delay Stochastic Gradient Descent for Distributed Training.
CoRR
(2020)
Yemao Xu
,
Dezun Dong
,
Yawei Zhao
,
Weixia Xu
,
Xiangke Liao
ssd-sgd: communication sparsification for distributed deep learning training.
CoRR
(2020)
Yemao Xu
,
Dezun Dong
,
Weixia Xu
,
Xiangke Liao
SketchDLC: A Sketch on Distributed Deep Learning Communication via Trace Capturing.
ACM Trans. Archit. Code Optim.
16 (2) (2019)
Jingwei Chen
,
Li Shen
,
Zhiying Wang
,
Ning Li
,
Yemao Xu
Dynamic Power-Performance Adjustment on Clustered Multi-Threading Processors.
NAS
(2016)
Yemao Xu
,
Jialong Wang
,
Yanhong Liu
,
Li Shen
Fast Task Submission in Software Thread Level Speculation Systems.
Trustcom/BigDataSE/ISPA
(2016)
Ning Li
,
Li Shen
,
Qi Zhu
,
Yemao Xu
,
Jialong Wang
,
Zhiying Wang
An implementation of analytical power model on integrated GPU.
ISIC
(2016)