Login / Signup

MINT: a Multi-modal Image and Narrative Text Dubbing Dataset for Foley Audio Content Planning and Generation.

Ruibo FuShuchen ShiHongming GuoTao WangChunyu QiangZhengqi WenJianhua TaoXin QiYi LuXiaopeng WangZhiyong WangYukun LiuXuefei LiuShuai ZhangGuanjun Li
Published in: CoRR (2024)
Keyphrases