AGIBench: A Multi-granularity, Multimodal, Human-Referenced, Auto-Scoring Benchmark for Large Language Models.
Fei TangWanling GaoLuzhou PengJianfeng ZhanPublished in: Bench (2023)
Keyphrases
- language model
- multi granularity
- language modeling
- document retrieval
- n gram
- speech recognition
- language modelling
- multi user
- probabilistic model
- test collection
- query expansion
- information retrieval
- retrieval model
- human subjects
- statistical language models
- query terms
- location aware
- dynamic integration
- language models for information retrieval
- document ranking
- relevance model
- privacy protection
- multi modal
- smoothing methods
- pseudo relevance feedback
- audio visual