Login / Signup

Cross-Modal Fine-Tuning: Align then Refine.

Junhong ShenLiam LiLucio M. DeryCorey StatenMikhail KhodakGraham NeubigAmeet Talwalkar
Published in: CoRR (2023)
Keyphrases
  • fine tuning
  • cross modal
  • multi modal
  • image retrieval
  • fine tuned
  • multimedia databases
  • multimedia retrieval
  • visual data
  • visual recognition
  • perceptual information
  • visual similarity
  • image sequences