MUGEN: A Playground for Video-Audio-Text Multimodal Understanding and GENeration.

Published in: ECCV (8) (2022)

Keyphrases