Tell Me the Evidence? Dual Visual-Linguistic Interaction for Answer Grounding.
Junwen PanGuanlin ChenYi LiuJiexiang WangCheng BianPengfei ZhuZhicheng ZhangPublished in: CoRR (2022)
Keyphrases
- human computer interaction
- user interaction
- visual features
- natural language processing
- visual feedback
- visual perception
- visual information
- natural language
- higher level
- visual data
- human interaction
- machine learning
- low level
- lower bound
- high level
- visual cues
- computer vision
- human vision
- linguistic knowledge
- linguistic features
- evidential reasoning