Login / Signup

Understanding Cross-modal Interactions in V&L Models that Generate Scene Descriptions.

Michele CafagnaKees van DeemterAlbert Gatt
Published in: CoRR (2022)
Keyphrases
  • cross modal
  • information retrieval
  • video sequences
  • multi modal
  • visual data
  • feature selection
  • three dimensional