Login / Signup
StableToolBench: Towards Stable Large-Scale Benchmarking on Tool Learning of Large Language Models.
Zhicheng Guo
Sijie Cheng
Hao Wang
Shihao Liang
Yujia Qin
Peng Li
Zhiyuan Liu
Maosong Sun
Yang Liu
Published in:
ACL (Findings) (2024)
Keyphrases
</>
language model
language modeling
probabilistic model
n gram
query terms
information retrieval
speech recognition
test collection
context sensitive
active learning
text classification
query expansion
statistical language models