Utilizing Neural Transducers for Two-Stage Text-to-Speech via Semantic Token Prediction.

Published in: CoRR (2024)

Keyphrases