Multimedia analysis of robustly optimized multimodal transformer based on vision and language co-learning.
Junho YoonGyu Ho ChoiChang ChoiPublished in: Inf. Fusion (2023)
Keyphrases
- multimedia
- learning process
- real time
- language acquisition
- object oriented programming
- vision system
- data analysis
- learning environment
- statistical analysis
- learning algorithm
- machine learning
- digital libraries
- image analysis
- computer vision
- prior knowledge
- supervised learning
- artificial intelligence
- online learning
- multi modal
- unsupervised learning
- mobile learning