Login / Signup

FM-OV3D: Foundation Model-Based Cross-Modal Knowledge Blending for Open-Vocabulary 3D Detection.

Dongmei ZhangChang LiRenrui ZhangShenghao XieWei XueXiaodong XieShanghang Zhang
Published in: AAAI (2024)
Keyphrases
  • cross modal
  • knowledge base
  • multi modal
  • multimedia retrieval
  • visual similarity
  • perceptual information
  • low level
  • multimedia databases