Probing Inter-modality: Visual Parsing with Self-Attention for Vision-and-Language Pre-training.

Published in: NeurIPS (2021)

Keyphrases