Login / Signup

Is multi-modal vision supervision beneficial to language?

Avinash MadasuVasudev Lal
Published in: CoRR (2023)
Keyphrases
  • multi modal
  • computer vision
  • multi modality
  • image annotation
  • audio visual
  • cross modal
  • fusing multiple