InternLM-XComposer: A Vision-Language Large Model for Advanced Text-image Comprehension and Composition.
Pan ZhangXiaoyi DongBin WangYuhang CaoChao XuLinke OuyangZhiyuan ZhaoShuangrui DingSongyang ZhangHaodong DuanWenwei ZhangHang YanXinyue ZhangWei LiJingwen LiKai ChenConghui HeXingcheng ZhangYu QiaoDahua LinJiaqi WangPublished in: CoRR (2023)
Keyphrases
- statistical model
- image analysis
- high level
- low level
- vision system
- single image
- image features
- edge detection
- random fields
- computational model
- multiscale
- image content
- segmentation method
- prior model
- specification language
- computational linguistics
- cognitive processes
- image collections
- probability distribution
- image retrieval
- computer vision
- image classification
- high resolution
- image data