Multimodal Token Fusion for Vision Transformers.
Yikai WangXinghao ChenLele CaoWenbing HuangFuchun SunYunhe WangPublished in: CoRR (2022)
Keyphrases
- multimodal fusion
- multimodal interfaces
- vision system
- multimodal biometrics
- data fusion
- computer vision
- fusion method
- multi sensor
- multimodal interaction
- audio visual
- information fusion
- real time
- brain image analysis
- image fusion
- multi modal
- fusion algorithm
- active vision
- multimedia
- multimodal data
- computational vision
- case study
- multi modality
- high robustness
- artificial intelligence
- learning algorithm
- neural network
- data sets