Login / Signup
Hierarchical Multimodal Transformer to Summarize Videos.
Bin Zhao
Maoguo Gong
Xuelong Li
Published in:
CoRR (2021)
Keyphrases
</>
multi modal
fuzzy logic
video sequences
video data
video frames
fault diagnosis
key frames
video content
video database
youtube videos
neural network
multimodal data
multimodal interaction
hierarchical model
video search
multi party
multimedia