Login / Signup
Ruslan Svirschevski
Publication Activity (10 Years)
Years Active: 2023-2024
Publications (10 Years): 4
Top Topics
Massively Parallel
Main Idea Consists
Compression Ratio
Data Compression
Top Venues
CoRR
ICLR
</>
Publications
</>
Tim Dettmers
,
Ruslan Svirschevski
,
Vage Egiazarian
,
Denis Kuznedelev
,
Elias Frantar
,
Saleh Ashkboos
,
Alexander Borzunov
,
Torsten Hoefler
,
Dan Alistarh
SpQR: A Sparse-Quantized Representation for Near-Lossless LLM Weight Compression.
ICLR
(2024)
Zhuoming Chen
,
Avner May
,
Ruslan Svirschevski
,
Yuhsun Huang
,
Max Ryabinin
,
Zhihao Jia
,
Beidi Chen
Sequoia: Scalable, Robust, and Hardware-aware Speculative Decoding.
CoRR
(2024)
Ruslan Svirschevski
,
Avner May
,
Zhuoming Chen
,
Beidi Chen
,
Zhihao Jia
,
Max Ryabinin
SpecExec: Massively Parallel Speculative Decoding for Interactive LLM Inference on Consumer Devices.
CoRR
(2024)
Tim Dettmers
,
Ruslan Svirschevski
,
Vage Egiazarian
,
Denis Kuznedelev
,
Elias Frantar
,
Saleh Ashkboos
,
Alexander Borzunov
,
Torsten Hoefler
,
Dan Alistarh
SpQR: A Sparse-Quantized Representation for Near-Lossless LLM Weight Compression.
CoRR
(2023)