Login / Signup
Word2Pix: Word to Pixel Cross-Attention Transformer in Visual Grounding.
Heng Zhao
Joey Tianyi Zhou
Yew-Soon Ong
Published in:
IEEE Trans. Neural Networks Learn. Syst. (2024)
Keyphrases
</>
co occurrence
word recognition
related words
keywords
n gram
neural network
natural language text
input image
visual features
visual information
word sense disambiguation
text corpus
selective attention
stop words
word meaning