Login / Signup
End-to-End Modeling via Information Tree for One-Shot Natural Language Spatial Video Grounding.
Mengze Li
Tianbao Wang
Haoyu Zhang
Shengyu Zhang
Zhou Zhao
Jiaxu Miao
Wenqiao Zhang
Wenming Tan
Jin Wang
Peng Wang
Shiliang Pu
Fei Wu
Published in:
CoRR (2022)
Keyphrases
</>
end to end
natural language
spatial information
real time
end users
video streams
multimedia
multimedia data
admission control
scalable video