AGIBench: A Multi-granularity, Multimodal, Human-referenced, Auto-scoring Benchmark for Large Language Models.
Fei TangWanling GaoLuzhou PengJianfeng ZhanPublished in: CoRR (2023)
Keyphrases
- language model
- multi granularity
- language modeling
- n gram
- probabilistic model
- document retrieval
- language modelling
- information retrieval
- statistical language models
- retrieval model
- multi user
- dynamic integration
- test collection
- speech recognition
- query expansion
- smoothing methods
- human subjects
- privacy protection
- document ranking
- multi modal
- relevance model
- location aware
- language models for information retrieval
- spoken term detection
- query terms
- pseudo relevance feedback