MAGVLT: Masked Generative Vision-and-Language Transformer.
Sungwoong KimDaejin JoDonghoon LeeJongmin KimPublished in: CVPR (2023)
Keyphrases
- programming language
- language learning
- computer vision
- generative model
- natural language
- english language
- specification language
- vision system
- high voltage
- computational vision
- language processing
- modeling language
- image processing
- real time
- data driven
- power system
- general purpose
- software engineering
- probability distribution
- visual perception
- high level
- operational semantics
- visual field
- database