Login / Signup

End-to-End Modeling via Information Tree for One-Shot Natural Language Spatial Video Grounding.

Mengze LiTianbao WangHaoyu ZhangShengyu ZhangZhou ZhaoJiaxu MiaoWenqiao ZhangWenming TanJin WangPeng WangShiliang PuFei Wu
Published in: CoRR (2022)
Keyphrases
  • end to end
  • natural language
  • spatial information
  • real time
  • end users
  • video streams
  • multimedia
  • multimedia data
  • admission control
  • scalable video