​
Login / Signup
Ke Hong
ORCID
Publication Activity (10 Years)
Years Active: 2012-2024
Publications (10 Years): 14
Top Topics
Efficient Inference
Point Cloud
N Gram
Language Model
Top Venues
CoRR
MLSys
ASPLOS (3)
MICRO
</>
Publications
</>
Ke Hong
,
Guohao Dai
,
Jiaming Xu
,
Qiuli Mao
,
Xiuhong Li
,
Jun Liu
,
Kangdi Chen
,
Yuhan Dong
,
Yu Wang
FlashDecoding++: Faster Large Language Model Inference with Asynchronization, Flat GEMM Optimization, and Heuristics.
MLSys
(2024)
Kai Zhong
,
Zhenhua Zhu
,
Guohao Dai
,
Hongyi Wang
,
Xinhao Yang
,
Haoyu Zhang
,
Jin Si
,
Qiuli Mao
,
Shulin Zeng
,
Ke Hong
,
Genghan Zhang
,
Huazhong Yang
,
Yu Wang
FEASTA: A Flexible and Efficient Accelerator for Sparse Tensor Algebra in Machine Learning.
ASPLOS (3)
(2024)
Zixuan Zhou
,
Xuefei Ning
,
Ke Hong
,
Tianyu Fu
,
Jiaming Xu
,
Shiyao Li
,
Yuming Lou
,
Luning Wang
,
Zhihang Yuan
,
Xiuhong Li
,
Shengen Yan
,
Guohao Dai
,
Xiao-Ping Zhang
,
Yuhan Dong
,
Yu Wang
A Survey on Efficient Inference for Large Language Models.
CoRR
(2024)
Yaoxiu Lian
,
Xinhao Yang
,
Ke Hong
,
Yu Wang
,
Guohao Dai
,
Ningyi Xu
A Point Transformer Accelerator with Fine-Grained Pipelines and Distribution-Aware Dynamic FPS.
ICCAD
(2023)
Yaofeng Tu
,
Jiahao Niu
,
Dezheng Wang
,
Hong Gao
,
Jin Xu
,
Ke Hong
,
Fang Yang
BDMasker: Dynamic Data Protection System for Open Big Data Environment.
Int. J. Softw. Informatics
13 (1) (2023)
Haotian Tang
,
Shang Yang
,
Zhijian Liu
,
Ke Hong
,
Zhongming Yu
,
Xiuyu Li
,
Guohao Dai
,
Yu Wang
,
Song Han
TorchSparse++: Efficient Point Cloud Engine.
CVPR Workshops
(2023)
Tianchen Zhao
,
Xuefei Ning
,
Ke Hong
,
Zhongyuan Qiu
,
Pu Lu
,
Yali Zhao
,
Linfeng Zhang
,
Lipu Zhou
,
Guohao Dai
,
Huazhong Yang
,
Yu Wang
Ada3D : Exploiting the Spatial Redundancy with Adaptive Inference for Efficient 3D Object Detection.
CoRR
(2023)
Xinhao Yang
,
Tianyu Fu
,
Guohao Dai
,
Shulin Zeng
,
Kai Zhong
,
Ke Hong
,
Yu Wang
An Efficient Accelerator for Point-based and Voxel-based Point Cloud Neural Networks.
DAC
(2023)
Haotian Tang
,
Shang Yang
,
Zhijian Liu
,
Ke Hong
,
Zhongming Yu
,
Xiuyu Li
,
Guohao Dai
,
Yu Wang
,
Song Han
TorchSparse++: Efficient Training and Inference Framework for Sparse Convolution on GPUs.
CoRR
(2023)
Ke Hong
,
Zhongming Yu
,
Guohao Dai
,
Xinhao Yang
,
Yaoxiu Lian
,
Zehao Liu
,
Ningyi Xu
,
Yuhan Dong
,
Yu Wang
Exploiting Hardware Utilization and Adaptive Dataflow for Efficient Sparse Convolution in 3D Point Clouds.
MLSys
(2023)
Ke Hong
,
Guohao Dai
,
Jiaming Xu
,
Qiuli Mao
,
Xiuhong Li
,
Jun Liu
,
Kangdi Chen
,
Yuhan Dong
,
Yu Wang
FlashDecoding++: Faster Large Language Model Inference on GPUs.
CoRR
(2023)
Haotian Tang
,
Shang Yang
,
Zhijian Liu
,
Ke Hong
,
Zhongming Yu
,
Xiuyu Li
,
Guohao Dai
,
Yu Wang
,
Song Han
TorchSparse++: Efficient Training and Inference Framework for Sparse Convolution on GPUs.
MICRO
(2023)
Tianchen Zhao
,
Xuefei Ning
,
Ke Hong
,
Zhongyuan Qiu
,
Pu Lu
,
Yali Zhao
,
Linfeng Zhang
,
Lipu Zhou
,
Guohao Dai
,
Huazhong Yang
,
Yu Wang
Ada3D : Exploiting the Spatial Redundancy with Adaptive Inference for Efficient 3D Object Detection.
ICCV
(2023)
Ke Hong
,
Tianyu Wang
,
Junchen Liu
,
Yu Wang
,
Yuan Shen
A Learning-Based AoA Estimation Method for Device-Free Localization.
IEEE Commun. Lett.
26 (6) (2022)
Ke Hong
,
Shuo Yang
,
Zhiqiang Ma
,
Lin Gu
A Synergy of the Wireless Sensor Network and the Data Center System.
MASS
(2013)
Zhiqiang Ma
,
Ke Hong
,
Lin Gu
VOLUME: Enable Large-Scale In-Memory Computation on Commodity Clusters.
CloudCom (1)
(2013)
Shuo Yang
,
Ke Hong
,
Lin Gu
Poster Abstract: Involving a Sensor Network System in Core Datacenter Management Functions.
ICCPS
(2012)