Login / Signup
FM-OV3D: Foundation Model-Based Cross-Modal Knowledge Blending for Open-Vocabulary 3D Detection.
Dongmei Zhang
Chang Li
Renrui Zhang
Shenghao Xie
Wei Xue
Xiaodong Xie
Shanghang Zhang
Published in:
AAAI (2024)
Keyphrases
</>
cross modal
knowledge base
multi modal
multimedia retrieval
visual similarity
perceptual information
low level
multimedia databases