Understanding Attention for Vision-and-Language Tasks.
Feiqi CaoSoyeon Caren HanSiqu LongChangwei XuJosiah PoonPublished in: CoRR (2022)
Keyphrases
- computer vision
- real time
- visually guided
- programming language
- vision system
- database
- multiple tasks
- website
- databases
- computational model
- language understanding
- specification language
- deeper understanding
- conceptual graphs
- multi task
- visual attention
- language learning
- search engine
- general purpose
- object oriented
- multi agent systems
- image processing
- information systems
- machine learning