Login / Signup
TRIPS: Efficient Vision-and-Language Pre-training with Text-Relevant Image Patch Selection.
Chaoya Jiang
Haiyang Xu
Chenliang Li
Ming Yan
Wei Ye
Shikun Zhang
Bin Bi
Songfang Huang
Published in:
EMNLP (2022)
Keyphrases
</>
image patches
computer vision
natural images
language generation
pattern recognition
high level
training samples
information retrieval
image segmentation
image sequences
feature extraction
multiscale
decision trees
high dimensional data
image processing
computational linguistics
english text