Gemini vs GPT-4V: A Preliminary Comparison and Combination of Vision-Language Models Through Qualitative Cases.

Zhangyang QiYe FangMengchen ZhangZeyi SunTong WuZiwei LiuDahua LinJiaqi WangHengshuang Zhao
Published in: CoRR (2023)
Keyphrases