Sign in

V2A-Mapper: A Lightweight Solution for Vision-to-Audio Generation by Connecting Foundation Models.

Heng WangJianbo MaSantiago PascualRichard CartwrightWeidong Cai
Published in: CoRR (2023)
Keyphrases
  • lightweight
  • probabilistic model
  • image processing
  • multimedia
  • handheld devices
  • computer vision
  • optimal solution
  • vision system
  • model selection