A Progressive Framework of Vision-language Knowledge Distillation and Alignment for Multilingual Scene.
Wenbo ZhangYifan ZhangJianfeng LinBinqiang HuangJinlu ZhangWenhao YuPublished in: CoRR (2024)
Keyphrases
- representation language
- image processing
- conceptual model
- expert systems
- prior knowledge
- computer vision
- conceptual framework
- single image
- knowledge acquisition
- knowledge management
- digital libraries
- real time
- input image
- object detection
- probabilistic model
- data mining techniques
- domain knowledge
- data model
- d scene
- metadata
- context dependent