FHGN: Frame-Level Heterogeneous Graph Networks for Video Question Answering.
Jinsheng QiFangtao LiTing BaiChenyu CaoWei SongYibo HuBin WuPublished in: ICME (2022)
Keyphrases
- question answering
- video frames
- information extraction
- information retrieval
- natural language processing
- key frames
- question classification
- heterogeneous networks
- question answering systems
- video sequences
- passage retrieval
- structured data
- syntactic information
- relation extraction
- video data
- natural language
- named entities
- cross language
- natural language questions
- multimedia
- qa clef
- artificial intelligence
- open domain question answering
- sentence retrieval
- answering questions
- answer extraction
- qa systems
- semantic roles
- multi domain
- video search
- graph structure
- multi modal
- image retrieval
- expert systems