Publication: Pre-Training Multi-Modal Dense Retrievers for Outside-Knowledge Visual Question Answering.