Question-Aware Global-Local Video Understanding Network for Audio-Visual Question Answering.
Zailong ChenLei WangPeng WangPeng GaoPublished in: IEEE Trans. Circuits Syst. Video Technol. (2024)
Keyphrases
- question answering
- audio visual
- passage retrieval
- question classification
- question answering systems
- visual data
- answering questions
- multimedia
- qa clef
- natural language questions
- multi modal
- qa systems
- answer validation
- answer extraction
- visual information
- candidate answers
- information retrieval
- information extraction
- natural language
- question answer pairs
- document retrieval
- natural language processing
- named entities
- video sequences
- space time
- video data
- video search
- artificial intelligence
- video frames
- text mining