Enhancing multi-modal fusion in visual dialog via sample debiasing and feature interaction.
Chenyu LuJun YinHao YangShiliang SunPublished in: Inf. Fusion (2024)
Keyphrases
- multi modal fusion
- human computer interaction
- natural language
- image features
- user interaction
- human interaction
- feature vectors
- image classification
- multimodal interaction
- visual information
- visual perception
- image sequences
- orientation selective
- dialog systems
- facial features
- sample size
- neural network
- feature set
- user interface