MulCogBench: A Multi-modal Cognitive Benchmark Dataset for Evaluating Chinese and English Computational Language Models.
Yunhao ZhangXiaohan ZhangChong LiShaonan WangChengqing ZongPublished in: CoRR (2024)
Keyphrases
- multi modal
- language model
- benchmark datasets
- language modeling
- cross language retrieval
- probabilistic model
- speech recognition
- information retrieval
- n gram
- retrieval model
- statistical machine translation
- document retrieval
- statistical language models
- language modelling
- cross lingual
- multi modality
- query expansion
- chinese english
- audio visual
- foreign language
- word segmentation
- translation model
- test collection
- natural language
- document ranking
- smoothing methods
- video search
- high dimensional
- out of vocabulary
- relevance model
- cross language
- language models for information retrieval