​
Login / Signup
Ao Ren
ORCID
Publication Activity (10 Years)
Years Active: 2016-2024
Publications (10 Years): 73
Top Topics
Incremental Update
Heterogeneous Systems
Alternating Direction
Neural Network
Top Venues
CoRR
ICCD
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst.
ISVLSI
</>
Publications
</>
Yujuan Tan
,
Zhuoxin Bai
,
Duo Liu
,
Zhaoyang Zeng
,
Yan Gan
,
Ao Ren
,
Xianzhang Chen
,
Kan Zhong
BGS: Accelerate GNN training on multiple GPUs.
J. Syst. Archit.
153 (2024)
Ruiqing Lei
,
Xianzhang Chen
,
Duo Liu
,
Chunlin Song
,
Yujuan Tan
,
Ao Ren
CEIU: Consistent and Efficient Incremental Update mechanism for mobile systems on flash storage.
J. Syst. Archit.
152 (2024)
Chengliang Wang
,
Zhetong Huang
,
Ao Ren
,
Xun Zhang
An FPGA-based kNN Seach Accelerator for point cloud registration.
ISCAS
(2024)
Kan Zhong
,
Zhiwang Yu
,
Qiao Li
,
Xianqiang Luo
,
Linbo Long
,
Yujuan Tan
,
Ao Ren
,
Duo Liu
DPC: DPU-accelerated High-Performance File System Client.
ICPP
(2024)
Chunlin Song
,
Xianzhang Chen
,
Duo Liu
,
Jiali Li
,
Yujuan Tan
,
Ao Ren
Optimizing the Performance of Consistency-Aware Deduplication Using Persistent Memory.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst.
43 (6) (2024)
Jing Yu
,
Yujuan Tan
,
Ao Ren
,
Duo Liu
Trustworthy Self-Attention: Enabling the Network to Focus Only on the Most Relevant References.
CoRR
(2024)
Jing Yu
,
Yujuan Tan
,
Ao Ren
,
Duo Liu
YOIO: You Only Iterate Once by mining and fusing multiple necessary global information in the optical flow estimation.
CoRR
(2024)
Xun Zhang
,
Chengliang Wang
,
Xingquan Piao
,
Ao Ren
,
Zhetong Huang
SCRA: Systolic-Friendly DNN Compression and Reconfigurable Accelerator Co-Design.
ISPA/BDCloud/SocialCom/SustainCom
(2023)
Lin Li
,
Xianzhang Chen
,
Jiali Li
,
Jiapin Wang
,
Duo Liu
,
Yujuan Tan
,
Ao Ren
Optimizing the Performance of NDP Operations by Retrieving File Semantics in Storage.
DAC
(2023)
Yi Chen
,
Ning Liu
,
Ao Ren
,
Tao Yang
,
Duo Liu
IFHE: Intermediate-Feature Heterogeneity Enhancement for Image Synthesis in Data-Free Knowledge Distillation.
DAC
(2023)
Ruiqing Lei
,
Xianzhang Chen
,
Duo Liu
,
Chunlin Song
,
Yujuan Tan
,
Ao Ren
Optimizing the Incremental Update Mechanism by Inlaying File Indexes on Flash Storage.
NVMSA
(2023)
Yuling Zhang
,
Ao Ren
,
Xianzhang Chen
,
Qiu Lin
,
Yujuan Tan
,
Duo Liu
Re-compact: Structured Pruning and SpMM Kernel Co-design for Accelerating DNNs on GPUs.
ICCD
(2023)
Guo Li
,
Xianzhang Chen
,
Duo Liu
,
Jiali Li
,
Yujuan Tan
,
Ao Ren
An Efficient Scheduling Algorithm for Multi-mode Tasks on Near-Data Processing SSDs.
ICA3PP (7)
(2023)
Xuehong Fan
,
Nanzhong Wu
,
Shukan Liu
,
Xianzhang Chen
,
Duo Liu
,
Yujuan Tan
,
Ao Ren
Data-Quality-Driven Federated Learning for Optimizing Communication Costs.
ICPADS
(2023)
Ao Ren
,
Yuhao Wang
,
Tao Zhang
,
Jiaxing Shi
,
Duo Liu
,
Xianzhang Chen
,
Yujuan Tan
,
Yuan Xie
HBP: Hierarchically Balanced Pruning and Accelerator Co-Design for Efficient DNN Inference.
DAC
(2023)
Jiali Li
,
Xianzhang Chen
,
Duo Liu
,
Ao Ren
,
Zhaoyang Zeng
,
Yujuan Tan
RadarSSD: A Computational Storage for Radar Signal Processing.
ICPP
(2023)
Yu Zhang
,
Duo Liu
,
Moming Duan
,
Li Li
,
Xianzhang Chen
,
Ao Ren
,
Yujuan Tan
,
Chengliang Wang
FedMDS: An Efficient Model Discrepancy-Aware Semi-Asynchronous Clustered Federated Learning Framework.
IEEE Trans. Parallel Distributed Syst.
34 (3) (2023)
Tianyu Xu
,
Xianzhang Chen
,
Changze Wu
,
Jiapin Wang
,
Rongwei Zheng
,
Duo Liu
,
Yujuan Tan
,
Ao Ren
,
Jian Li
3DS: An Efficient DPDK-based Data Distribution Service for Distributed Real-time Applications.
HPCC/DSS/SmartCity/DependSys
(2022)
Moming Duan
,
Duo Liu
,
Xinyuan Ji
,
Yu Wu
,
Liang Liang
,
Xianzhang Chen
,
Yujuan Tan
,
Ao Ren
Flexible Clustered Federated Learning for Client-Level Data Distribution Shift.
IEEE Trans. Parallel Distributed Syst.
33 (11) (2022)
Xin Li
,
Ao Ren
,
Yujuan Tan
,
Xusheng Li
,
Zhetong Huang
,
Chengliang Wang
,
Xianzhang Chen
,
Duo Liu
VEA: An FPGA-Based Voxel Encoding Accelerator for 3D Object Detection with LiDAR.
ICCD
(2022)
Li Li
,
Duo Liu
,
Moming Duan
,
Yu Zhang
,
Ao Ren
,
Xianzhang Chen
,
Yujuan Tan
,
Chengliang Wang
Federated learning with workload-aware client scheduling in heterogeneous systems.
Neural Networks
154 (2022)
Mengda Yang
,
Ziang Li
,
Juan Wang
,
Hongxin Hu
,
Ao Ren
,
Xiaoyang Xu
,
Wenzhe Yi
Measuring Data Reconstruction Defenses in Collaborative Inference Systems.
NeurIPS
(2022)
Rongwei Zheng
,
Xianzhang Chen
,
Duo Liu
,
Junjie Feng
,
Jiapin Wang
,
Ao Ren
,
Chengliang Wang
,
Yujuan Tan
SENTunnel: Fast Path for Sensor Data Access on Automotive Embedded Systems.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst.
41 (11) (2022)
Xiaofeng Ding
,
Chengliang Wang
,
Heping Liu
,
Zhihai Zhang
,
Xianzhang Chen
,
Yujuan Tan
,
Duo Liu
,
Ao Ren
FRL: Fast and Reconfigurable Accelerator for Distributed Sound Source Localization.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst.
41 (11) (2022)
Chunlin Song
,
Xianzhang Chen
,
Duo Liu
,
Xiaoliu Feng
,
Xi Yu
,
Jiali Li
,
Yujuan Tan
,
Ao Ren
CADedup: High-performance Consistency-aware Deduplication Based on Persistent Memory.
ICCD
(2022)
Geng Yuan
,
Zhiheng Liao
,
Xiaolong Ma
,
Yuxuan Cai
,
Zhenglun Kong
,
Xuan Shen
,
Jingyan Fu
,
Zhengang Li
,
Chengming Zhang
,
Hongwu Peng
,
Ning Liu
,
Ao Ren
,
Jinhui Wang
,
Yanzhi Wang
Improving DNN Fault Tolerance using Weight Pruning and Differential Crossbar Mapping for ReRAM-based Edge AI.
ISQED
(2021)
Yu Zhang
,
Moming Duan
,
Duo Liu
,
Li Li
,
Ao Ren
,
Xianzhang Chen
,
Yujuan Tan
,
Chengliang Wang
CSAFL: A Clustered Semi-Asynchronous Federated Learning Framework.
CoRR
(2021)
Jinshan Yue
,
Yongpan Liu
,
Ruoyang Liu
,
Wenyu Sun
,
Zhe Yuan
,
Yung-Ning Tu
,
Yi-Ju Chen
,
Ao Ren
,
Yanzhi Wang
,
Meng-Fan Chang
,
Xueqing Li
,
Huazhong Yang
STICKER-T: An Energy-Efficient Neural Network Processor Using Block-Circulant Algorithm and Unified Frequency-Domain Acceleration.
IEEE J. Solid State Circuits
56 (6) (2021)
Li Li
,
Moming Duan
,
Duo Liu
,
Yu Zhang
,
Ao Ren
,
Xianzhang Chen
,
Yujuan Tan
,
Chengliang Wang
FedSAE: A Novel Self-Adaptive Federated Learning Framework in Heterogeneous Systems.
CoRR
(2021)
Geng Yuan
,
Zhiheng Liao
,
Xiaolong Ma
,
Yuxuan Cai
,
Zhenglun Kong
,
Xuan Shen
,
Jingyan Fu
,
Zhengang Li
,
Chengming Zhang
,
Hongwu Peng
,
Ning Liu
,
Ao Ren
,
Jinhui Wang
,
Yanzhi Wang
Improving DNN Fault Tolerance using Weight Pruning and Differential Crossbar Mapping for ReRAM-based Edge AI.
CoRR
(2021)
Yu Zhang
,
Moming Duan
,
Duo Liu
,
Li Li
,
Ao Ren
,
Xianzhang Chen
,
Yujuan Tan
,
Chengliang Wang
CSAFL: A Clustered Semi-Asynchronous Federated Learning Framework.
IJCNN
(2021)
Li Li
,
Moming Duan
,
Duo Liu
,
Yu Zhang
,
Ao Ren
,
Xianzhang Chen
,
Yujuan Tan
,
Chengliang Wang
FedSAE: A Novel Self-Adaptive Federated Learning Framework in Heterogeneous Systems.
IJCNN
(2021)
Ao Ren
,
Tao Zhang
,
Yuhao Wang
,
Sheng Lin
,
Peiyan Dong
,
Yen-Kuang Chen
,
Yuan Xie
,
Yanzhi Wang
DARB: A Density-Adaptive Regular-Block Pruning for Deep Neural Networks.
AAAI
(2020)
Burak Kakillioglu
,
Ao Ren
,
Yanzhi Wang
,
Senem Velipasalar
3D Capsule Networks for Object Classification With Weight Pruning.
IEEE Access
8 (2020)
Ruizhe Cai
,
Ao Ren
,
Olivia Chen
,
Ning Liu
,
Caiwen Ding
,
Xuehai Qian
,
Jie Han
,
Wenhui Luo
,
Nobuyuki Yoshikawa
,
Yanzhi Wang
A Stochastic-Computing based Deep Learning Framework using Adiabatic Quantum-Flux-Parametron SuperconductingTechnology.
CoRR
(2019)
Zhe Li
,
Ji Li
,
Ao Ren
,
Ruizhe Cai
,
Caiwen Ding
,
Xuehai Qian
,
Jeffrey Draper
,
Bo Yuan
,
Jian Tang
,
Qinru Qiu
,
Yanzhi Wang
HEIF: Highly Efficient Stochastic Computing-Based Inference Framework for Deep Neural Networks.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst.
38 (8) (2019)
Ruizhe Cai
,
Olivia Chen
,
Ao Ren
,
Ning Liu
,
Caiwen Ding
,
Nobuyuki Yoshikawa
,
Yanzhi Wang
A Majority Logic Synthesis Framework for Adiabatic Quantum-Flux-Parametron Superconducting Circuits.
ACM Great Lakes Symposium on VLSI
(2019)
Ruizhe Cai
,
Xiaolong Ma
,
Olivia Chen
,
Ao Ren
,
Ning Liu
,
Nobuyuki Yoshikawa
,
Yanzhi Wang
IDE Development, Logic Synthesis and Buffer/Splitter Insertion Framework for Adiabatic Quantum-Flux-Parametron Superconducting Circuits.
ISVLSI
(2019)
Ao Ren
,
Tianyun Zhang
,
Shaokai Ye
,
Jiayu Li
,
Wenyao Xu
,
Xuehai Qian
,
Xue Lin
,
Yanzhi Wang
ADMM-NN: An Algorithm-Hardware Co-Design Framework of DNNs Using Alternating Direction Methods of Multipliers.
ASPLOS
(2019)
Ruizhe Cai
,
Ao Ren
,
Olivia Chen
,
Ning Liu
,
Caiwen Ding
,
Xuehai Qian
,
Jie Han
,
Wenhui Luo
,
Nobuyuki Yoshikawa
,
Yanzhi Wang
A stochastic-computing based deep learning framework using adiabatic quantum-flux-parametron superconducting technology.
ISCA
(2019)
Jinshan Yue
,
Ruoyang Liu
,
Wenyu Sun
,
Zhe Yuan
,
Zhibo Wang
,
Yung-Ning Tu
,
Yi-Ju Chen
,
Ao Ren
,
Yanzhi Wang
,
Meng-Fan Chang
,
Xueqing Li
,
Huazhong Yang
,
Yongpan Liu
and 6T HBST-TRAM-Based 2D Data-Reuse Architecture.
ISSCC
(2019)
Ji Li
,
Zihao Yuan
,
Zhe Li
,
Ao Ren
,
Caiwen Ding
,
Jeffrey Draper
,
Shahin Nazarian
,
Qinru Qiu
,
Bo Yuan
,
Yanzhi Wang
Normalization and dropout for stochastic computing-based deep convolutional neural networks.
Integr.
65 (2019)
Ruizhe Cai
,
Olivia Chen
,
Ao Ren
,
Ning Liu
,
Nobuyuki Yoshikawa
,
Yanzhi Wang
A Buffer and Splitter Insertion Framework for Adiabatic Quantum-Flux-Parametron Superconducting Circuits.
ICCD
(2019)
Ao Ren
,
Tao Zhang
,
Yuhao Wang
,
Sheng Lin
,
Peiyan Dong
,
Yen-Kuang Chen
,
Yuan Xie
,
Yanzhi Wang
DARB: A Density-Aware Regular-Block Pruning for Deep Neural Networks.
CoRR
(2019)
Ruizhe Cai
,
Ao Ren
,
Ning Liu
,
Caiwen Ding
,
Luhao Wang
,
Xuehai Qian
,
Massoud Pedram
,
Yanzhi Wang
VIBNN: Hardware Acceleration of Bayesian Neural Networks.
CoRR
(2018)
Ruizhe Cai
,
Ao Ren
,
Ning Liu
,
Caiwen Ding
,
Luhao Wang
,
Xuehai Qian
,
Massoud Pedram
,
Yanzhi Wang
VIBNN: Hardware Acceleration of Bayesian Neural Networks.
ASPLOS
(2018)
Zhe Li
,
Ji Li
,
Ao Ren
,
Caiwen Ding
,
Jeffrey Draper
,
Qinru Qiu
,
Bo Yuan
,
Yanzhi Wang
Towards Budget-Driven Hardware Optimization for Deep Convolutional Neural Networks Using Stochastic Computing.
ISVLSI
(2018)
Caiwen Ding
,
Ao Ren
,
Geng Yuan
,
Xiaolong Ma
,
Jiayu Li
,
Ning Liu
,
Bo Yuan
,
Yanzhi Wang
Structured Weight Matrices-Based Hardware Accelerators in Deep Neural Networks: FPGAs and ASICs.
CoRR
(2018)
Xiaolong Ma
,
Yipeng Zhang
,
Geng Yuan
,
Ao Ren
,
Zhe Li
,
Jie Han
,
Jingtong Hu
,
Yanzhi Wang
An Area and Energy Efficient Design of Domain-Wall Memory-Based Deep Convolutional Neural Networks using Stochastic Computing.
CoRR
(2018)
Ruizhe Cai
,
Ao Ren
,
Sucheta Soundarajan
,
Yanzhi Wang
A low-computation-complexity, energy-efficient, and high-performance linear program solver based on primal-dual interior point method using memristor crossbars.
Nano Commun. Networks
18 (2018)
Xiaolong Ma
,
Yipeng Zhang
,
Geng Yuan
,
Ao Ren
,
Zhe Li
,
Jie Han
,
Jingtong Hu
,
Yanzhi Wang
An area and energy efficient design of domain-wall memory-based deep convolutional neural networks using stochastic computing.
ISQED
(2018)
Caiwen Ding
,
Ao Ren
,
Geng Yuan
,
Xiaolong Ma
,
Jiayu Li
,
Ning Liu
,
Bo Yuan
,
Yanzhi Wang
Structured Weight Matrices-Based Hardware Accelerators in Deep Neural Networks: FPGAs and ASICs.
ACM Great Lakes Symposium on VLSI
(2018)
Ao Ren
,
Tianyun Zhang
,
Shaokai Ye
,
Jiayu Li
,
Wenyao Xu
,
Xuehai Qian
,
Xue Lin
,
Yanzhi Wang
ADMM-NN: An Algorithm-Hardware Co-Design Framework of DNNs Using Alternating Direction Method of Multipliers.
CoRR
(2018)
Zhe Li
,
Ji Li
,
Ao Ren
,
Caiwen Ding
,
Jeffrey Draper
,
Qinru Qiu
,
Bo Yuan
,
Yanzhi Wang
Towards Budget-Driven Hardware Optimization for Deep Convolutional Neural Networks using Stochastic Computing.
CoRR
(2018)
Ao Ren
,
Sijia Liu
,
Ruizhe Cai
,
Wujie Wen
,
Pramod K. Varshney
,
Yanzhi Wang
Algorithm-Hardware Co-Optimization of the Memristor-Based Framework for Solving SOCP and Homogeneous QCQP Problems.
CoRR
(2018)
Ji Li
,
Zihao Yuan
,
Zhe Li
,
Caiwen Ding
,
Ao Ren
,
Qinru Qiu
,
Jeffrey Draper
,
Yanzhi Wang
Hardware-driven nonlinear activation for stochastic computing based deep convolutional neural networks.
IJCNN
(2017)
Ji Li
,
Ao Ren
,
Zhe Li
,
Caiwen Ding
,
Bo Yuan
,
Qinru Qiu
,
Yanzhi Wang
Towards acceleration of deep convolutional neural networks using stochastic computing.
ASP-DAC
(2017)
Ruizhe Cai
,
Ao Ren
,
Luhao Wang
,
Massoud Pedram
,
Yanzhi Wang
Hardware Acceleration of Bayesian Neural Networks Using RAM Based Linear Feedback Gaussian Random Number Generators.
ICCD
(2017)
Zihao Yuan
,
Ji Li
,
Zhe Li
,
Caiwen Ding
,
Ao Ren
,
Bo Yuan
,
Qinru Qiu
,
Jeffrey Draper
,
Yanzhi Wang
Softmax Regression Design for Stochastic Computing Based Deep Convolutional Neural Networks.
ACM Great Lakes Symposium on VLSI
(2017)
Ao Ren
,
Sijia Liu
,
Ruizhe Cai
,
Wujie Wen
,
Pramod K. Varshney
,
Yanzhi Wang
Algorithm-hardware co-optimization of the memristor-based framework for solving SOCP and homogeneous QCQP problems.
ASP-DAC
(2017)
Ji Li
,
Zihao Yuan
,
Zhe Li
,
Caiwen Ding
,
Ao Ren
,
Qinru Qiu
,
Jeffrey T. Draper
,
Yanzhi Wang
Hardware-Driven Nonlinear Activation for Stochastic Computing Based Deep Convolutional Neural Networks.
CoRR
(2017)
Ao Ren
,
Zhe Li
,
Caiwen Ding
,
Qinru Qiu
,
Yanzhi Wang
,
Ji Li
,
Xuehai Qian
,
Bo Yuan
SC-DCNN: Highly-Scalable Deep Convolutional Neural Network using Stochastic Computing.
ASPLOS
(2017)
Hongjia Li
,
Tianshu Wei
,
Ao Ren
,
Qi Zhu
,
Yanzhi Wang
Deep reinforcement learning: Framework, applications, and embedded implementations: Invited paper.
ICCAD
(2017)
Hongjia Li
,
Tianshu Wei
,
Ao Ren
,
Qi Zhu
,
Yanzhi Wang
Deep Reinforcement Learning: Framework, Applications, and Embedded Implementations.
CoRR
(2017)
Zhe Li
,
Ao Ren
,
Ji Li
,
Qinru Qiu
,
Bo Yuan
,
Jeffrey Draper
,
Yanzhi Wang
Structural design optimization for deep convolutional neural networks using stochastic computing.
DATE
(2017)
Sijia Liu
,
Ao Ren
,
Yanzhi Wang
,
Pramod K. Varshney
Ultra-fast robust compressive sensing based on memristor crossbars.
ICASSP
(2017)
Geng Yuan
,
Caiwen Ding
,
Ruizhe Cai
,
Xiaolong Ma
,
Ziyi Zhao
,
Ao Ren
,
Bo Yuan
,
Yanzhi Wang
Memristor crossbar-based ultra-efficient next-generation baseband processors.
MWSCAS
(2017)
Ao Ren
,
Zhe Li
,
Yanzhi Wang
,
Qinru Qiu
,
Bo Yuan
Designing reconfigurable large-scale deep learning systems using stochastic computing.
ICRC
(2016)
Ruizhe Cai
,
Ao Ren
,
Yanzhi Wang
,
Bo Yuan
Memristor-Based Discrete Fourier Transform for Improving Performance and Energy Efficiency.
ISVLSI
(2016)
Ruizhe Cai
,
Ao Ren
,
Yanzhi Wang
,
Sucheta Soundarajan
,
Qinru Qiu
,
Bo Yuan
,
Paul Bogdan
A low-computation-complexity, energy-efficient, and high-performance linear program solver using memristor crossbars.
SoCC
(2016)
Ao Ren
,
Bo Yuan
,
Yanzhi Wang
Design of high-speed low-power polar BP decoder using emerging technologies.
SoCC
(2016)
Zhe Li
,
Ao Ren
,
Ji Li
,
Qinru Qiu
,
Yanzhi Wang
,
Bo Yuan
DSCNN: Hardware-oriented optimization for Stochastic Computing based Deep Convolutional Neural Networks.
ICCD
(2016)
Ao Ren
,
Ji Li
,
Zhe Li
,
Caiwen Ding
,
Xuehai Qian
,
Qinru Qiu
,
Bo Yuan
,
Yanzhi Wang
SC-DCNN: Highly-Scalable Deep Convolutional Neural Network using Stochastic Computing.
CoRR
(2016)