Parallel GEMM-based convolution for deep learning on multicore RISC-V processors.
Cristián RamírezAdrián CastellóHéctor MartínezEnrique S. Quintana-OrtíPublished in: J. Supercomput. (2024)
Keyphrases
- deep learning
- shared memory
- parallel programming
- instruction set
- level parallelism
- parallel algorithm
- parallel processing
- multicore processors
- distributed memory
- parallel computing
- high end
- parallel computation
- unsupervised learning
- unsupervised feature learning
- parallel execution
- message passing
- machine learning
- application specific
- multi core processors
- graphics processing units
- parallel architectures
- cell processor
- computer architecture
- processing units
- mental models
- image processing
- weakly supervised
- floating point
- parallel implementation
- mesh connected
- co occurrence
- information extraction