Login / Signup
Enhance audio generation controllability through representation similarity regularization.
Yangyang Shi
Gaël Le Lan
Varun Nagaraja
Zhaoheng Ni
Xinhao Mei
Ernie Chang
Forrest N. Iandola
Yang Liu
Vikas Chandra
Published in:
CoRR (2023)
Keyphrases
</>
similarity measure
multimedia
distance metric
visual information
euclidean distance
spatial location
cross modal