How Should We Extract Discrete Audio Tokens from Self-Supervised Models?

Published in: CoRR (2024)

Keyphrases