CRIPP-VQA: Counterfactual Reasoning about Implicit Physical Properties via Video Question Answering.
Maitreya PatelTejas GokhaleChitta BaralYezhou YangPublished in: EMNLP (2022)
Keyphrases
- question answering
- video database
- information retrieval
- information extraction
- video quality assessment
- question classification
- multimedia
- natural language processing
- named entities
- syntactic information
- natural language
- question answering systems
- passage retrieval
- cross language
- open domain question answering
- video data
- answer extraction
- qa clef
- sentence retrieval
- video content
- video frames
- image database
- relation extraction
- video clips
- video retrieval
- video streams
- answering questions
- key frames
- information retrieval systems
- video sequences
- artificial intelligence