Login / Signup

A Review of Multi-Modal Large Language and Vision Models.

Kilian CarolanLaura FennellyAlan F. Smeaton
Published in: CoRR (2024)
Keyphrases
  • multi modal
  • multi modality
  • computer vision
  • cross modal
  • high dimensional
  • fusing multiple
  • image annotation
  • audio visual
  • semantic concepts