Masked Vision and Language Pre-training with Unimodal and Multimodal Contrastive Losses for Medical Visual Question Answering.
Pengfei LiGang LiuJinlong HeZixu ZhaoShenjun ZhongPublished in: CoRR (2023)
Keyphrases
- question answering
- natural language
- natural language processing
- named entities
- cross language
- question classification
- information retrieval
- qa clef
- passage retrieval
- information extraction
- natural language questions
- visual information
- syntactic information
- visual features
- sentence retrieval
- question answering systems
- artificial intelligence
- relation extraction
- test set
- multi modal
- speech transcripts
- semantic roles
- qa systems
- candidate answers
- automatically generated
- text mining
- open domain question answering