MAGVLT: Masked Generative Vision-and-Language Transformer.
Sungwoong KimDaeJin JoDonghoon LeeJongmin KimPublished in: CoRR (2023)
Keyphrases
- computer vision
- programming language
- language processing
- generative model
- data driven
- natural language
- vision system
- data sets
- fault diagnosis
- power system
- fuzzy logic
- image processing
- neural network
- object recognition
- e learning
- information systems
- language learning
- real time
- visual perception
- object oriented programming
- human vision