Login / Signup

VLRM: Vision-Language Models act as Reward Models for Image Captioning.

Maksim DzabraevAlexander KunitsynAndrei Ivaniuta
Published in: CoRR (2024)
Keyphrases