Hierarchical Conditional Relation Networks for Multimodal Video Question Answering.
Thao Minh LeVuong LeSvetha VenkateshTruyen TranPublished in: Int. J. Comput. Vis. (2021)
Keyphrases
- question answering
- multimedia
- question classification
- cross language
- information retrieval
- natural language processing
- natural language
- video data
- video sequences
- information extraction
- question answering systems
- video content
- natural language questions
- passage retrieval
- answer validation
- qa clef
- named entities
- relation extraction
- multi modal
- sentence retrieval
- video frames
- syntactic information
- qa systems
- video shots
- candidate answers
- expert systems
- search engine