Login / Signup
Weakly Supervised Dense Video Captioning via Jointly Usage of Knowledge Distillation and Cross-modal Matching.
Bofeng Wu
Guocheng Niu
Jun Yu
Xinyan Xiao
Jian Zhang
Hua Wu
Published in:
CoRR (2021)
Keyphrases
</>
weakly supervised
cross modal
multi modal
topic models
visual data
knowledge base
multimedia
object class
video sequences
keypoints
semi supervised
input image
video data
superpixels