Scaling Down to Scale Up: A Cost-Benefit Analysis of Replacing OpenAI's LLM with Open Source SLMs in Production.
Chandra IrugalbandaraAshish MahendraRoland DaynauthTharuka Kasthuri ArachchigeJayanaka DantanarayanaKrisztián FlautnerLingjia TangYiping KangJason MarsPublished in: ISPASS (2024)