End-to-End Modeling via Information Tree for One-Shot Natural Language Spatial Video Grounding.

Published in: CoRR (2022)

Keyphrases