MM-PhyQA: Multimodal Physics Question-Answering With Multi-Image CoT Prompting.
Avinash AnandJanak KapuriyaApoorv SinghJay SarafNaman LalAstha VermaRushali GuptaRajiv Ratn ShahPublished in: CoRR (2024)
Keyphrases
- question answering
- image classification
- natural language
- information retrieval
- image features
- natural language processing
- image representation
- information extraction
- named entities
- image content
- qa clef
- low level
- question classification
- open domain question answering
- image retrieval
- relation extraction
- question answering systems
- natural language questions
- answer validation
- automatically generated
- passage retrieval