Multi: Multimodal Understanding Leaderboard with Text and Images.
Zichen ZhuYang XuLu ChenJingkai YangYichuan MaYiming SunHailin WenJiaqi LiuJinyu CaiYingzi MaSituo ZhangZihan ZhaoLiangtai SunKai YuPublished in: CoRR (2024)
Keyphrases
- image data
- input image
- image database
- three dimensional
- ground truth
- web images
- image features
- textual information
- image retrieval
- image analysis
- image collections
- image classification
- text extraction
- feature points
- edge detection
- natural language processing
- test images
- lighting conditions
- image registration
- object recognition
- information retrieval
- image quality
- text classification
- natural images
- image set
- face recognition
- text regions