Flexible Communication Avoiding Matrix Multiplication on FPGA with High-Level Synthesis.
Johannes de Fine LichtGrzegorz KwasniewskiTorsten HoeflerPublished in: CoRR (2019)
Keyphrases
- matrix multiplication
- high level synthesis
- parallel architecture
- distributed memory
- message passing
- data acquisition
- matrix factorization
- hardware implementation
- parallel processing
- shared memory
- signal processing
- computer vision
- distributed systems
- field programmable gate array
- computer science
- three dimensional
- image segmentation
- image processing