Probing Inter-modality: Visual Parsing with Self-Attention for Vision-Language Pre-training.

Published in: CoRR (2021)

Keyphrases