C
search
search
reviewers
reviewers
feeds
feeds
assignments
assignments
settings
logout
Towards Efficient Generative Large Language Model Serving: A Survey from Algorithms to Systems.
Xupeng Miao
Gabriele Oliaro
Zhihao Zhang
Xinhao Cheng
Hongyi Jin
Tianqi Chen
Zhihao Jia
Published in:
CoRR (2023)
Keyphrases
</>
language model
document retrieval
information retrieval
speech recognition
language modeling
query expansion
n gram
language modelling
retrieval model
pseudo relevance feedback
ir models
smoothing methods