Login / Signup

Multimodal Large Language Model is a Human-Aligned Annotator for Text-to-Image Generation.

Xun WuShaohan HuangFuru Wei
Published in: CoRR (2024)
Keyphrases