Reformulating Vision-Language Foundation Models and Datasets Towards Universal Multimodal Assistants.

Published in: CoRR (2023)

Keyphrases