Login / Signup
Andrés E. Tomás
ORCID
Publication Activity (10 Years)
Years Active: 2017-2024
Publications (10 Years): 21
Top Topics
Deep Learning
Sparse Matrix
Gram Schmidt Orthogonalization
Singular Value Decomposition
Top Venues
CoRR
J. Supercomput.
J. Syst. Archit.
Concurr. Comput. Pract. Exp.
</>
Publications
</>
Andrés E. Tomás
,
Enrique S. Quintana-Ortí
,
Hartwig Anzt
Fast Truncated SVD of Sparse and Dense Matrices on Graphics Processors.
CoRR
(2024)
Manuel F. Dolz
,
Sergio Barrachina
,
Héctor Martínez
,
Adrián Castelló
,
Antonio-Manuel Vidal-Maciá
,
Germán Fabregat
,
Andrés E. Tomás
Performance-energy trade-offs of deep learning convolution algorithms on ARM processors.
J. Supercomput.
79 (9) (2023)
José Ignacio Aliaga
,
Hartwig Anzt
,
Enrique S. Quintana-Ortí
,
Andrés E. Tomás
Sparse matrix-vector and matrix-multivector products for the truncated SVD on graphics processors.
Concurr. Comput. Pract. Exp.
35 (28) (2023)
Andrés E. Tomás
,
Enrique S. Quintana-Ortí
,
Hartwig Anzt
Fast truncated SVD of sparse and dense matrices on graphics processors.
Int. J. High Perform. Comput. Appl.
37 (3-4) (2023)
José Ignacio Aliaga
,
Hartwig Anzt
,
Thomas Grützmacher
,
Enrique S. Quintana-Ortí
,
Andrés E. Tomás
Compressed basis GMRES on high-performance graphics processing units.
Int. J. High Perform. Comput. Appl.
37 (2) (2023)
Sergio Barrachina
,
Adrián Castelló
,
Manuel F. Dolz
,
Tze Meng Low
,
Héctor Martínez
,
Enrique S. Quintana-Ortí
,
Upasana Sridhar
,
Andrés E. Tomás
Reformulating the direct convolution for high-performance deep learning inference on ARM processors.
J. Syst. Archit.
135 (2023)
Andrés E. Tomás
,
Enrique S. Quintana-Ortí
Tall-and-Skinny QR Factorization for Clusters of GPUs Using High-Performance Building Blocks.
Euro-Par Workshops (1)
(2023)
José Ignacio Aliaga
,
Hartwig Anzt
,
Thomas Grützmacher
,
Enrique S. Quintana-Ortí
,
Andrés E. Tomás
Compression and load balancing for efficient sparse matrix-vector product on multicore processors and graphics processing units.
Concurr. Comput. Pract. Exp.
34 (14) (2022)
Adrián Castelló
,
Sergio Barrachina
,
Manuel F. Dolz
,
Enrique S. Quintana-Ortí
,
Pau San Juan
,
Andrés E. Tomás
High performance and energy efficient inference for deep learning on multicore ARM processors using general optimization techniques and BLIS.
J. Syst. Archit.
125 (2022)
Sergio Barrachina
,
Adrián Castelló
,
Manuel F. Dolz
,
Andrés E. Tomás
BestOf: an online implementation selector for the training and inference of deep neural networks.
J. Supercomput.
78 (16) (2022)
José Ignacio Aliaga
,
Hartwig Anzt
,
Enrique S. Quintana-Ortí
,
Andrés E. Tomás
,
Yuhsiang M. Tsai
Balanced and Compressed Coordinate Layout for the Sparse Matrix-Vector Product on GPUs.
Euro-Par Workshops
(2020)
Andrés E. Tomás
,
Enrique S. Quintana-Ortí
Tall-and-skinny QR factorization with approximate Householder reflectors on graphics processors.
J. Supercomput.
76 (11) (2020)
José Ignacio Aliaga
,
Hartwig Anzt
,
Thomas Grützmacher
,
Enrique S. Quintana-Ortí
,
Andrés E. Tomás
Compressed Basis GMRES on High Performance GPUs.
CoRR
(2020)
Goran Flegar
,
Florian Scheidegger
,
Vedran Novakovic
,
Giovanni Mariani
,
Andrés E. Tomás
,
A. Cristiano I. Malossi
,
Enrique S. Quintana-Ortí
FloatX: A C++ Library for Customized Floating-Point Arithmetic.
ACM Trans. Math. Softw.
45 (4) (2019)
Rafael Rodríguez-Sánchez
,
Sandra Catalán
,
José R. Herrero
,
Enrique S. Quintana-Ortí
,
Andrés E. Tomás
Look-ahead in the two-sided reduction to compact band forms for symmetric eigenvalue problems and the SVD.
Numer. Algorithms
80 (2) (2019)
Andrés E. Tomás
,
Rafael Rodríguez-Sánchez
,
Sandra Catalán
,
Rocío Carratalá-Sáez
,
Enrique S. Quintana-Ortí
Dynamic look-ahead in the reduction to band form for the singular value decomposition.
Parallel Comput.
81 (2019)
Andrés E. Tomás
,
Enrique S. Quintana-Ortí
Cholesky and Gram-Schmidt Orthogonalization for Tall-and-Skinny QR Factorizations on Graphics Processors.
Euro-Par
(2019)
Andrés E. Tomás
,
Rafael Rodríguez-Sánchez
,
Sandra Catalán
,
Enrique S. Quintana-Ortí
Reduction to Band Form for the Singular Value Decomposition on Graphics Accelerators.
PMAM@PPoPP
(2018)
Hartwig Anzt
,
Goran Flegar
,
Vedran Novakovic
,
Enrique S. Quintana-Ortí
,
Andrés E. Tomás
Residual Replacement in Mixed-Precision Iterative Refinement for Sparse Linear Systems.
ISC Workshops
(2018)
Rafael Rodríguez-Sánchez
,
Sandra Catalán
,
José R. Herrero
,
Enrique S. Quintana-Ortí
,
Andrés E. Tomás
Two-Sided Reduction to Compact Band Forms with Look-Ahead.
CoRR
(2017)
Hartwig Anzt
,
Jack J. Dongarra
,
Goran Flegar
,
Enrique S. Quintana-Ortí
,
Andrés E. Tomás
Variable-Size Batched Gauss-Huard for Block-Jacobi Preconditioning.
ICCS
(2017)