QuTI! Quantifying Text-Image Consistency in Multimodal Documents.
Matthias SpringsteinEric Müller-BudackRalph EwerthPublished in: SIGIR (2021)
Keyphrases
- information retrieval
- input image
- image features
- single image
- image data
- text information
- scanned documents
- printed documents
- image analysis
- multiscale
- text documents
- free text
- web documents
- textual information
- document analysis
- keywords
- image representation
- web images
- image regions
- textual descriptions
- plagiarism detection
- latent semantic analysis
- handwritten documents
- text retrieval
- image content
- high resolution
- image retrieval
- relevant documents
- document processing
- semantic information
- document collections
- image classification
- text content
- similarity measure
- image segmentation
- text queries
- document categorization
- natural language text
- key concepts
- text data
- semantic similarity
- text mining
- search engine