Prompting Large Language Models with Fine-Grained Visual Relations from Scene Graph for Visual Question Answering.
Jiapeng LiuChengyang FangLiang LiBing LiDayong HuCan MaPublished in: ICASSP (2024)
Keyphrases
- fine grained
- question answering
- language model
- passage retrieval
- information retrieval
- visual information
- sentence retrieval
- language modeling
- information extraction
- access control
- natural language processing
- n gram
- visual data
- natural language
- question classification
- query expansion
- structured data
- speech recognition
- probabilistic model
- document retrieval
- xml documents
- video search
- metadata