RAMM: Retrieval-augmented Biomedical Visual Question Answering with Multi-modal Pre-training.
Zheng YuanQiao JinChuanqi TanZhengyun ZhaoHongyi YuanFei HuangSongfang HuangPublished in: ACM Multimedia (2023)
Keyphrases
- multi modal
- question answering
- cross modal
- passage retrieval
- video search
- information retrieval
- information extraction
- audio visual
- natural language processing
- named entities
- answer extraction
- document retrieval
- cross language
- question answering systems
- natural language
- sentence retrieval
- single modality
- high dimensional
- visual information
- relation extraction
- natural language questions
- syntactic information
- text mining
- training set
- speech transcripts
- retrieval systems
- information retrieval systems
- relevance feedback
- low level
- image retrieval
- high level
- retrieval model
- test collection
- test set
- visual features
- language model