Augmenting Vision Language Pretraining by Learning Codebook with Visual Semantics.
Xiaoyuan GuoJiali DuanC.-C. Jay KuoJudy Wawira GichoyaImon BanerjeePublished in: CoRR (2022)
Keyphrases
- learning process
- language acquisition
- image processing
- operational semantics
- vision system
- learning algorithm
- visual processing
- online learning
- programming language
- language learning
- computer vision
- machine learning
- logical language
- visual learning
- real time
- object oriented programming
- visual perception
- learning tasks
- visual information
- bag of words
- vector quantization
- learning systems
- supervised learning
- feature vectors