COCA: COllaborative CAusal Regularization for Audio-Visual Question Answering.
Mingrui LaoNan PuYu LiuKai HeErwin M. BakkerMichael S. LewPublished in: AAAI (2023)
Keyphrases
- audio visual
- question answering
- passage retrieval
- multi modal
- visual information
- information retrieval
- natural language processing
- multimedia
- named entities
- information extraction
- natural language
- natural language questions
- visual data
- question answering systems
- document collections
- qa systems
- answer extraction