Structure-CLIP: Towards Scene Graph Knowledge to Enhance Multi-Modal Structured Representations.

Published in: AAAI (2024)

Keyphrases