Login / Signup

L-STAP: Learned Spatio-Temporal Adaptive Pooling for Video Captioning.

Danny FrancisBenoit Huet
Published in: AI4TV@MM (2019)
Keyphrases