An Empirical Study of Multilingual Scene-Text Visual Question Answering.
Lin LiHaohan ZhangZeqin FangPublished in: NarSUM@MM (2023)
Keyphrases
- question answering
- cross language
- information retrieval
- natural language processing
- scene text
- named entities
- natural language
- visual information
- information extraction
- question answering systems
- text detection
- digital libraries
- visual features
- high level
- object recognition
- qa systems
- natural scene images
- search engine
- relevant documents