Ladder: Enabling Efficient Low-Precision Deep Learning Computing through Hardware-aware Tensor Transformation.
Lei WangLingxiao MaShijie CaoQuanlu ZhangJilong XueYining ShiNingxin ZhengZiming MiaoFan YangTing CaoYuqing YangMao YangPublished in: OSDI (2024)