AAST-NLP at Multimodal Hate Speech Event Detection 2024 : A Multimodal Approach for Classification of Text-Embedded Images Based on CLIP and BERT-Based Models.
Ahmed El-SayedOmar NasrPublished in: CASE (2024)
Keyphrases
- event detection
- image classification
- video segments
- multiple modalities
- multi modal
- input image
- web images
- audio visual
- video surveillance
- text mining
- machine learning
- video event detection
- object recognition
- image retrieval
- feature space
- information retrieval
- keywords
- image annotation
- machine learning algorithms
- information extraction
- image features
- text classification
- decision trees
- event recognition
- sports video
- video analysis
- image collections
- probabilistic model
- natural language processing
- feature vectors
- low level
- natural language
- activity recognition
- feature extraction
- active learning
- knn