A Multimodal Aggregation Network With Serial Self-Attention Mechanism for Micro-Video Multi-Label Classification.
Wei LuJiaxin LinPeiguang JingYuting SuPublished in: IEEE Signal Process. Lett. (2023)
Keyphrases
- multi label classification
- multi label
- attention mechanism
- video sequences
- multimedia
- image annotation
- multi modal
- video retrieval
- classification algorithm
- semi supervised learning
- binary classification
- image classification
- machine learning
- text categorization
- video data
- markov random field
- visual attention
- semi supervised
- object recognition
- face recognition
- meta level
- decision trees
- computer vision