Question Aware Vision Transformer for Multimodal Reasoning.
Roy GanzYair KittenplonAviad AberdamElad Ben-AvrahamOren NurielShai MazorRon LitmanPublished in: CoRR (2024)
Keyphrases
- knowledge base
- reasoning process
- computer vision
- reasoning systems
- image processing
- qualitative reasoning
- multimodal interfaces
- knowledge representation
- vision system
- formal models
- fault diagnosis
- multi modal
- automated reasoning
- fuzzy logic
- answering questions
- deductive reasoning
- analogical reasoning
- multimodal interaction
- default reasoning
- knowledge representation and reasoning
- computational properties
- computational vision
- reasoning tasks
- visual perception
- spatial reasoning
- real time
- production rules
- power system
- human computer interaction
- control system
- multimedia
- neural network
- data sets