CTRL-F: Pairing Convolution with Transformer for Image Classification via Multi-Level Feature Cross-Attention and Representation Learning Fusion.
Hosam S. El-AssioutiHadeer El-SaadawyMaryam N. Al-BerryMohamed F. TolbaPublished in: CoRR (2024)
Keyphrases
- image classification
- learning algorithm
- image representation
- learning systems
- reinforcement learning
- active learning
- spatial pyramid
- machine learning
- multiple representations
- visual recognition
- image features
- prior knowledge
- learning process
- object recognition
- fuzzy logic
- online learning
- mobile robot
- feature representation
- feature extraction