VILA: Learning Image Aesthetics from User Comments with Vision-Language Pretraining.

Published in: CVPR (2023)

Keyphrases