Multi-Scale Attention for Audio Question Answering.
Guangyao LiYixin XuDi HuPublished in: CoRR (2023)
Keyphrases
- question answering
- multiscale
- information extraction
- natural language processing
- natural language questions
- natural language
- information retrieval
- cross language
- passage retrieval
- question classification
- multimedia
- named entities
- question answering systems
- qa clef
- open domain question answering
- visual data
- visual information
- audio visual
- syntactic information
- answer extraction
- semantic roles
- relation extraction
- answer validation
- textual entailment recognition
- knowledge base