Sign in

Inferflow: an Efficient and Highly Configurable Inference Engine for Large Language Models.

Shuming ShiEnbo ZhaoDeng CaiLeyang CuiXinting HuangHuayang Li
Published in: CoRR (2024)
Keyphrases