Attention-Based Multimodal Deep Learning on Vision-Language Data: Models, Datasets, Tasks, Evaluation Metrics and Applications.
Priyankar BosePratip RanaPreetam GhoshPublished in: IEEE Access (2023)
Keyphrases
- evaluation metrics
- deep learning
- data model
- restricted boltzmann machine
- precision and recall
- unsupervised learning
- unsupervised feature learning
- evaluation measures
- machine learning
- database systems
- computer vision
- mental models
- learning to rank
- similarity measure
- object model
- co occurrence
- benchmark datasets
- natural language
- learning algorithm
- data sets