Login / Signup

Robust CLIP: Unsupervised Adversarial Fine-Tuning of Vision Embeddings for Robust Large Vision-Language Models.

Christian SchlarmannNaman Deep SinghFrancesco CroceMatthias Hein
Published in: CoRR (2024)
Keyphrases
  • language model
  • fine tuning
  • information retrieval
  • speech recognition
  • document retrieval
  • context sensitive
  • web search
  • language modelling