Sign in

TokenFlow: Rethinking Fine-grained Cross-modal Alignment in Vision-Language Retrieval.

Xiaohan ZouChangqiao WuLele ChengZhongyuan Wang
Published in: CoRR (2022)
Keyphrases