OmniVL: One Foundation Model for Image-Language and Video-Language Tasks.
Junke WangDongdong ChenZuxuan WuChong LuoLuowei ZhouYucheng ZhaoYujia XieCe LiuYu-Gang JiangLu YuanPublished in: CoRR (2022)
Keyphrases
- statistical model
- image data
- language learning
- similarity measure
- specification language
- programming language
- image analysis
- image retrieval
- input image
- reconstruction method
- image classification
- conceptual model
- high resolution
- low level
- probabilistic model
- natural language
- single image
- multiscale
- pixel values
- bayesian framework
- edge detection
- image content
- computational model
- super resolution
- computer vision
- multimedia