Login / Signup
: A Generalized Matrix Instruction Set for Accelerating Tensor Computation beyond GEMM.
Yunan Zhang
Po-An Tsai
Hung-Wei Tseng
Published in:
CoRR (2022)
Keyphrases
</>
instruction set
floating point
tensor factorization
application specific
computer architecture
high order
memory subsystem
information systems
nearest neighbor
general purpose
embedded systems
parallel computation
level parallelism
floating point arithmetic
instruction set architecture