OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion.
Hao WangPengzhen RenZequn JieXiao DongChengjian FengYinlong QianLin MaDongmei JiangYaowei WangXiangyuan LanXiaodan LiangPublished in: CoRR (2024)
Keyphrases
- automatic detection
- detection method
- object detection
- information fusion
- natural language
- multi modal fusion
- detection accuracy
- programming language
- false alarms
- false positives
- language learning
- image processing
- detection rate
- face detection
- combining multiple
- data sets
- medical images
- knowledge representation
- xml documents
- metadata