MMICL: Empowering Vision-language Model with Multi-Modal In-Context Learning.

Published in: ICLR (2024)

Keyphrases