Login / Signup
FM-OV3D: Foundation Model-based Cross-modal Knowledge Blending for Open-Vocabulary 3D Detection.
Dongmei Zhang
Chang Li
Ray Zhang
Shenghao Xie
Wei Xue
Xiaodong Xie
Shanghang Zhang
Published in:
CoRR (2023)
Keyphrases
</>
cross modal
multi modal
domain knowledge
knowledge base
object detection
perceptual information
information retrieval
visual data
visual recognition
e learning
co occurrence
visual similarity