Less is More: Removing Text-regions Improves CLIP Training Efficiency and Robustness.

Published in: CoRR (2023)

Keyphrases