Login / Signup
ConVQG: Contrastive Visual Question Generation with Multimodal Guidance.
Li Mi
Syrielle Montariol
Javiera Castillo-Navarro
Xianjie Dai
Antoine Bosselut
Devis Tuia
Published in:
CoRR (2024)
Keyphrases
</>
visual information
multimodal information
cross modal
multi modal
visual features
visual search
machine learning
website
low level
audio visual
visual perception
generation process
correct answers
answer questions
multimodal interaction