Login / Signup

Beyond Image-Text Matching: Verb Understanding in Multimodal Transformers Using Guided Masking.

Ivana BenováJana KoseckáMichal GregorMartin TamajkaMarcel VeselýMarián Simko
Published in: CoRR (2024)
Keyphrases