Object-based Appearance-Motion Heterogeneous Network for Video Question Answering.
Feifei XuZheng ZhongYitao ZhuGuangzhen LiYingchen ZhouWang ZhouPublished in: ICPADS (2023)
Keyphrases
- question answering
- heterogeneous networks
- dynamic textures
- key frames
- visual data
- video data
- question classification
- image sequences
- multimedia
- information extraction
- passage retrieval
- qa clef
- natural language
- video sequences
- natural language processing
- information retrieval
- video content
- question answering systems
- human motion
- video frames
- natural language questions
- syntactic information
- candidate answers
- answering questions
- video retrieval
- multimedia data
- video shots
- multiple types
- document retrieval
- text mining
- probabilistic model