High-performance optimizations on tiled many-core embedded systems: a matrix multiplication case study.
Arslan MunirFarinaz KoushanfarAnn Gordon-RossSanjay RankaPublished in: J. Supercomput. (2013)
Keyphrases
- embedded systems
- matrix multiplication
- case study
- distributed memory
- low cost
- embedded devices
- computing power
- embedded software
- message passing
- processing power
- real time systems
- resource limited
- software systems
- matrix factorization
- real time image processing
- hardware software
- embedded real time systems
- field programmable gate array
- parallel implementation
- hw sw
- communication technologies
- software development
- real world
- shared memory
- hardware and software
- open source
- recommender systems
- image processing
- real time
- scheduling problem
- high resolution
- protocol stack