Login / Signup

EnCLAP: Combining Neural Audio Codec and Audio-Text Joint Embedding for Automated Audio Captioning.

Jaeyeon KimJaeyoon JungJinjoo LeeSang Hoon Woo
Published in: ICASSP (2024)
Keyphrases