​
Login / Signup
Shiyi Cao
Publication Activity (10 Years)
Years Active: 2013-2024
Publications (10 Years): 14
Top Topics
Smoothing Methods
Language Model
Lightweight
Document Length
Top Venues
CoRR
OFC
Remote. Sens.
ExaMPI@SC
</>
Publications
</>
Ying Sheng
,
Shiyi Cao
,
Dacheng Li
,
Banghua Zhu
,
Zhuohan Li
,
Danyang Zhuo
,
Joseph E. Gonzalez
,
Ion Stoica
Fairness in Serving Large Language Models.
OSDI
(2024)
Shu Liu
,
Asim Biswal
,
Audrey Cheng
,
Xiangxi Mo
,
Shiyi Cao
,
Joseph E. Gonzalez
,
Ion Stoica
,
Matei Zaharia
Optimizing LLM Queries in Relational Workloads.
CoRR
(2024)
Ying Sheng
,
Shiyi Cao
,
Dacheng Li
,
Banghua Zhu
,
Zhuohan Li
,
Danyang Zhuo
,
Joseph E. Gonzalez
,
Ion Stoica
Fairness in Serving Large Language Models.
CoRR
(2024)
Byungsoo Jeon
,
Mengdi Wu
,
Shiyi Cao
,
Sunghyun Kim
,
Sunghyun Park
,
Neeraj Aggarwal
,
Colin Unger
,
Daiyaan Arfeen
,
Peiyuan Liao
,
Xupeng Miao
,
Mohammad Alizadeh
,
Gregory R. Ganger
,
Tianqi Chen
,
Zhihao Jia
GraphPipe: Improving Performance and Scalability of DNN Training with Graph Pipeline Parallelism.
CoRR
(2024)
Ying Sheng
,
Shiyi Cao
,
Dacheng Li
,
Coleman Hooper
,
Nicholas Lee
,
Shuo Yang
,
Christopher Chou
,
Banghua Zhu
,
Lianmin Zheng
,
Kurt Keutzer
,
Joseph Gonzalez
,
Ion Stoica
SLoRA: Scalable Serving of Thousands of LoRA Adapters.
MLSys
(2024)
Ling Yang
,
Zhaochen Yu
,
Tianjun Zhang
,
Shiyi Cao
,
Minkai Xu
,
Wentao Zhang
,
Joseph E. Gonzalez
,
Bin Cui
Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models.
CoRR
(2024)
Lianmin Zheng
,
Liangsheng Yin
,
Zhiqiang Xie
,
Jeff Huang
,
Chuyue Sun
,
Cody Hao Yu
,
Shiyi Cao
,
Christos Kozyrakis
,
Ion Stoica
,
Joseph E. Gonzalez
,
Clark W. Barrett
,
Ying Sheng
Efficiently Programming Large Language Models using SGLang.
CoRR
(2023)
Ying Sheng
,
Shiyi Cao
,
Dacheng Li
,
Coleman Hooper
,
Nicholas Lee
,
Shuo Yang
,
Christopher Chou
,
Banghua Zhu
,
Lianmin Zheng
,
Kurt Keutzer
,
Joseph E. Gonzalez
,
Ion Stoica
S-LoRA: Serving Thousands of Concurrent LoRA Adapters.
CoRR
(2023)
Shiyi Cao
,
Xijun Hu
,
Yezi Wang
,
Cunyou Chen
,
Dong Xu
,
Tingting Bai
Understanding Spatial-Temporal Interactions of Ecosystem Services and Their Drivers in a Multi-Scale Perspective of Miluo Using Multi-Source Remote Sensing Data.
Remote. Sens.
15 (14) (2023)
Yanze Wang
,
Tianyu Gao
,
Yaping Liu
,
Tao Xu
,
Wenbo Yu
,
Zhiqun Yang
,
Qiang Guo
,
Rui Zhou
,
Shiyi Cao
,
Xinhua Xiao
,
Lin Zhang
Novel Mirror-flipped Mode Permutation Technique for Long-haul Mode-division Multiplexing Transmissions.
OFC
(2022)
Xiuqi Huang
,
Shiyi Cao
,
Yuanning Gao
,
Xiaofeng Gao
,
Guihai Chen
LightPro: Lightweight Probabilistic Workload Prediction Framework for Database-as-a-Service.
ICWS
(2022)
Linbo Yang
,
Zhiqun Yang
,
Tao Xu
,
Lijie Hou
,
Rui Zhou
,
Lin Gan
,
Shiyi Cao
,
Xinhua Xiao
,
Lin Zhang
Low-loss Mode Field Adapter Using Reverse Tapering for Fundamental Mode Transmission over MMFs.
OFC
(2022)
Shiyi Cao
,
Salvatore Di Girolamo
,
Torsten Hoefler
Accelerating Data Serialization/Deserialization Protocols with In-Network Compute.
ExaMPI@SC
(2022)
Shiyi Cao
,
Yuanning Gao
,
Xiaofeng Gao
,
Guihai Chen
AdaM: An Adaptive Fine-Grained Scheme for Distributed Metadata Management.
ICPP
(2019)
Liangjia Zong
,
Han Zhao
,
Zhiyong Feng
,
Shiyi Cao
Demonstration of ultra-compact contentionless-ROADM based on flexible wavelength router.
ECOC
(2014)
Shiyi Cao
,
Feng Wang
,
Wilson Tam
,
Lap Ah Tse
,
Jean Hee Kim
,
Junan Liu
,
Zuxun Lu
A hybrid seasonal prediction model for tuberculosis incidence in China.
BMC Medical Informatics Decis. Mak.
13 (2013)
Bo Wu
,
Shaofeng Qiu
,
Zhiyong Feng
,
Shiyi Cao
,
Han Zhao
,
Junling Xiang
,
Chiwu Ding
,
Gordon Ning Liu
,
Ning Deng
,
Qianjin Xiong
Green and agile petabit optical sub-wavelength switching prototype for the future OTN multi-chassis switch cluster.
OFC/NFOEC
(2013)