Login / Signup

Unsupervised Audio-Caption Aligning Learns Correspondences Between Individual Sound Events and Textual Phrases.

Huang XieOkko RäsänenKonstantinos DrossosTuomas Virtanen
Published in: ICASSP (2022)
Keyphrases