ERNIE-ViL: Knowledge Enhanced Vision-Language Representations through Scene Graphs.
Fei YuJiji TangWeichong YinYu SunHao TianHua WuHaifeng WangPublished in: AAAI (2021)
Keyphrases
- real time
- knowledge representation
- natural language
- computer vision
- linguistic knowledge
- d scene
- programming language
- domain knowledge
- video sequences
- knowledge base
- image processing
- conceptual representation
- prior knowledge
- single image
- knowledge management
- image sequences
- vision system
- directed graph
- multiple images
- outdoor scenes
- representation language
- knowledge discovery
- visual scene
- formal languages
- expert systems