Unsupervised Improvement of Audio-Text Cross-Modal Representations.

Published in: WASPAA (2023)

Keyphrases