Boosting Audio Visual Question Answering via Key Semantic-Aware Cues.
Guangyao LiHenghui DuDi HuPublished in: CoRR (2024)
Keyphrases
- audio visual
- question answering
- passage retrieval
- multi modal
- question answering systems
- natural language processing
- natural language
- visual data
- information retrieval
- information extraction
- visual information
- named entities
- document retrieval
- artificial intelligence
- multimedia
- natural language questions
- image annotation
- high dimensional data
- query expansion
- image data
- data sources
- high dimensional