Parallel GEMM-based convolutions for deep learning on multicore ARM and RISC-V architectures.
Héctor MartínezSandra CatalánAdrián CastellóEnrique S. Quintana-OrtíPublished in: J. Syst. Archit. (2024)
Keyphrases
- deep learning
- shared memory
- multicore processors
- parallel programming
- level parallelism
- multi core processors
- unsupervised feature learning
- unsupervised learning
- parallel architectures
- instruction set
- weakly supervised
- parallel computing
- machine learning
- mental models
- message passing
- viewpoint
- computer architecture
- memory management
- graphics processing units
- floating point
- computer vision