TokenSplit: Using Discrete Speech Representations for Direct, Refined, and Transcript-Conditioned Speech Separation and Recognition.

Published in: INTERSPEECH (2023)

Keyphrases