Balancing task- and data-level parallelism to improve performance and energy consumption of matrix computations on the Intel Xeon Phi.
Manuel F. DolzFrancisco D. IgualThomas LudwigLuis PiñuelEnrique S. Quintana-OrtíPublished in: Comput. Electr. Eng. (2015)