Violet: A Vision-Language Model for Arabic Image Captioning with Gemini Decoder.

Published in: CoRR (2023)

Keyphrases