Login / Signup
Cris Cecka
ORCID
Publication Activity (10 Years)
Years Active: 2013-2024
Publications (10 Years): 7
Top Topics
General Purpose
Parallel Computation
Matrix Multiplication
Graphics Processing Units
Top Venues
CoRR
ASPLOS (3)
PPoPP
ACM Trans. Archit. Code Optim.
</>
Publications
</>
Khalid Ahmad
,
Cris Cecka
,
Michael Garland
,
Mary W. Hall
Exploring Data Layout for Sparse Tensor Times Dense Matrix on GPUs.
ACM Trans. Archit. Code Optim.
21 (1) (2024)
Bastian Hagedorn
,
Bin Fan
,
Hanfeng Chen
,
Cris Cecka
,
Michael Garland
,
Vinod Grover
Graphene: An IR for Optimized Tensor Computations on GPUs.
ASPLOS (3)
(2023)
Muhammad Osama
,
Duane Merrill
,
Cris Cecka
,
Michael Garland
,
John D. Owens
Stream-K: Work-Centric Parallel Decomposition for Dense Matrix-Matrix Multiplication on the GPU.
PPoPP
(2023)
Muhammad Osama
,
Duane Merrill
,
Cris Cecka
,
Michael Garland
,
John D. Owens
Stream-K: Work-centric Parallel Decomposition for Dense Matrix-Matrix Multiplication on the GPU.
CoRR
(2023)
Cris Cecka
Low communication FMM-accelerated FFT on GPUs.
SC
(2017)
Yang Shi
,
U. N. Niranjan
,
Animashree Anandkumar
,
Cris Cecka
Tensor Contractions with Extended BLAS Kernels on CPU and GPU.
HiPC
(2016)
Yang Shi
,
U. N. Niranjan
,
Animashree Anandkumar
,
Cris Cecka
Tensor Contractions with Extended BLAS Kernels on CPU and GPU.
CoRR
(2016)
Pierre-David Létourneau
,
Cris Cecka
,
Eric Darve
Cauchy Fast Multipole Method for General Analytic Kernels.
SIAM J. Sci. Comput.
36 (2) (2014)
Cris Cecka
,
Eric Darve
Fourier-Based Fast Multipole Method for the Helmholtz Equation.
SIAM J. Sci. Comput.
35 (1) (2013)
Cris Cecka
,
Simon K. Layton
FMMTL: FMM Template Library A Generalized Framework for Kernel Matrices.
ENUMATH
(2013)
Tommy MacWilliam
,
Cris Cecka
CrowdCL: Web-based volunteer computing with WebCL.
HPEC
(2013)