M3TTS: Multi-modal text-to-speech of multi-scale style control for dubbing.

Published in: Pattern Recognit. Lett. (2024)

Keyphrases