Login / Signup
BUS: Efficient and Effective Vision-language Pre-training with Bottom-Up Patch Summarization.
Chaoya Jiang
Haiyang Xu
Wei Ye
Qinghao Ye
Chenliang Li
Ming Yan
Bin Bi
Shikun Zhang
Fei Huang
Songfang Huang
Published in:
CoRR (2023)
Keyphrases
</>
cost effective
computationally efficient
programming language
data driven
highly efficient
real time
computer vision
high speed
computationally expensive
natural language
feature space
supervised learning
training examples
visual attention
training process
training phase