Response Length Perception and Sequence Scheduling: An LLM-Empowered LLM Inference Pipeline.
Zangwei ZhengXiaozhe RenFuzhao XueYang LuoXin JiangYang YouPublished in: NeurIPS (2023)
Keyphrases
- fixed length
- scheduling algorithm
- scheduling problem
- round robin
- bayesian model
- knowledge base
- resource allocation
- probabilistic inference
- variable length
- manufacturing cell
- genetic algorithm
- edit operations
- longest common subsequence
- flexible manufacturing systems
- grammatical inference
- visual perception
- machine intelligence
- resource constraints
- belief networks
- bayesian networks