Login / Signup

Learning a Contextualized Multimodal Embedding for Zero-shot Cooking Video Caption Generation.

Lin WangHongyi ZhangXingfu WangYan Xiong
Published in: MMAsia (2023)
Keyphrases