Login / Signup
Multi-Granularity Aggregation Transformer for Joint Video-Audio-Text Representation Learning.
Mengge He
Wenjing Du
Zhiquan Wen
Qing Du
Yutong Xie
Qi Wu
Published in:
IEEE Trans. Circuits Syst. Video Technol. (2023)
Keyphrases
</>
multimedia
multi granularity
learning algorithm
high dimensional
supervised learning
digital libraries
databases
feature extraction
text representation