Pushing the Limits of Narrow Precision Inferencing at Cloud Scale with Microsoft Floating Point.
Bita Darvish RouhaniDaniel LoRitchie ZhaoMing LiuJeremy FowersKalin OvtcharovAnna VinogradskySarah MassengillLita YangRay BittnerAlessandro ForinHaishan ZhuTaesik NaPrerak PatelShuai CheLok Chand KoppakaXia SongSubhojit SomKaustav DasSaurabh T.Steven K. ReinhardtSitaram LankaEric S. ChungDoug BurgerPublished in: NeurIPS (2020)