Login / Signup

What, when, and where? - Self-Supervised Spatio-Temporal Grounding in Untrimmed Multi-Action Videos from Narrated Instructions.

Brian ChenNina ShvetsovaAndrew RouditchenkoDaniel KondermannSamuel ThomasShih-Fu ChangRogério FerisJames R. GlassHilde Kuehne
Published in: CoRR (2023)
Keyphrases