Towards Efficient Cross-Modal Visual Textual Retrieval using Transformer-Encoder Deep Features.

Published in: CoRR (2021)

Keyphrases