FlightLLM: Efficient Large Language Model Inference with a Complete Mapping Flow on FPGAs.
Shulin ZengJun LiuGuohao DaiXinhao YangTianyu FuHongyi WangWenheng MaHanbo SunShiyao LiZixiao HuangYadong DaiJintao LiZehao WangRuoyu ZhangKairui WenXuefei NingYu WangPublished in: FPGA (2024)
Keyphrases