Space-Time Crop & Attend: Improving Cross-modal Video Representation Learning.

Published in: CoRR (2021)

Keyphrases