ManagerTower: Aggregating the Insights of Uni-Modal Experts for Vision-Language Representation Learning.
Xiao XuBei LiChenfei WuShao-Yen TsengAnahita BhiwandiwallaShachar RosenmanVasudev LalWanxiang CheNan DuanPublished in: ACL (1) (2023)
Keyphrases
- learning process
- language acquisition
- learning systems
- learning algorithm
- online learning
- active learning
- supervised learning
- computer vision
- mobile learning
- learning tasks
- inductive inference
- data mining
- prior knowledge
- domain knowledge
- knowledge representation
- programming language
- natural language
- multi modal
- language learning
- object oriented programming