A Multi-Modal Multilingual Benchmark for Document Image Classification.
Yoshinari FujinumaSiddharth VariaNishant SankaranSrikar AppalarajuBonan MinYogarshi VyasPublished in: CoRR (2023)
Keyphrases
- multi modal
- image classification
- multilingual information retrieval
- feature extraction
- bag of words
- image representation
- document collections
- multi modality
- digital libraries
- information retrieval
- semantic concepts
- visual features
- visual recognition
- retrieval systems
- information retrieval systems
- machine learning
- multi label
- visual words
- audio visual
- cross modal
- keywords
- image annotation
- video search
- image features
- high dimensional
- text documents
- document retrieval
- video retrieval
- cross lingual
- object recognition
- semantic information
- multimedia retrieval