Align before Fuse: Vision and Language Representation Learning with Momentum Distillation.
Junnan LiRamprasaath R. SelvarajuAkhilesh GotmareShafiq R. JotyCaiming XiongSteven Chu-Hong HoiPublished in: NeurIPS (2021)
Keyphrases
- learning process
- learning tasks
- vision system
- learning algorithm
- learning problems
- learning systems
- reinforcement learning
- computer vision
- learning scheme
- knowledge acquisition
- image processing
- real time
- positive examples
- dynamic bayesian networks
- language processing
- object oriented programming
- computer programming
- elementary school
- representation language
- structured representation