Unsupervised Improvement of Audio-Text Cross-Modal Representations.

Published in: CoRR (2023)

Keyphrases