PyTorch RPC: Distributed Deep Learning Built on Tensor-Optimized Remote Procedure Calls.
Shen LiPritam DamaniaLuca WehrstedtRohan VarmaOmkar SalpekarPavel BelevichHoward HuangYanli ZhaoLucas HosseiniWanchao LiangHongyi JiaShihao XuSatendra GeraAlisson G. AzzoliniGuoqiang Jerry ChenZachary DeVitoChaoyang HeAmir ZiashahabiAlban DesmaisonEdward Z. YangGregory ChananBrian VaughanManoj KrishnanJoseph S. SpisakSalman AvestimehrSoumith ChintalaPublished in: MLSys (2023)