A focus fusion attention mechanism integrated with image captions for knowledge graph-based visual question answering.

Published in: Signal Image Video Process. (2024)

Keyphrases