Heterogeneous Interactive Graph Network for Audio-Visual Question Answering.
Yihan ZhaoWei XiGairui BaiXinhui LiuJizhong ZhaoPublished in: Knowl. Based Syst. (2024)
Keyphrases
- question answering
- audio visual
- passage retrieval
- multi modal
- information retrieval
- natural language
- information extraction
- natural language processing
- natural language questions
- visual data
- document retrieval
- visual information
- named entities
- question answering systems
- language model
- multimedia
- nearest neighbor
- artificial intelligence
- machine learning
- data mining