Scene text visual question answering by using YOLO and STN.
Kimiya NouraliElham DolkhaniPublished in: Int. J. Speech Technol. (2024)
Keyphrases
- question answering
- information retrieval
- visual information
- named entities
- information extraction
- natural language
- natural language processing
- scene text
- low level
- text detection
- qa clef
- natural language questions
- high level
- question answering systems
- knowledge representation
- natural scene images
- object recognition
- visual features
- machine learning
- qa systems