Tools and techniques for performance - Exploiting the performance of 32 bit floating point arithmetic in obtaining 64 bit accuracy (revisiting iterative refinement for linear systems).
Julie LangouJulien LangouPiotr LuszczekJakub KurzakAlfredo ButtariJack J. DongarraPublished in: SC (2006)