U-DiT TTS: U-Diffusion Vision Transformer for Text-to-Speech.

Xin Jing Yi Chang Zijiang Yang Jiangjian Xie Andreas Triantafyllopoulos Björn W. Schuller

Published in: CoRR (2023)

Keyphrases

text to speech
speech synthesis
computer vision
vision system
anisotropic diffusion
prosodic features
word processing
programming tool
real time
fuzzy logic
image processing
text to speech synthesis
genetic algorithm
writing skills
english text
diffusion process
fault diagnosis
diffusion model
diffusion processes
nonlinear diffusion
information diffusion
multi modal
online learning