Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform.
Masaya KawamuraYuma ShirahataRyuichi YamamotoKentaro TachibanaPublished in: ICASSP (2023)
Keyphrases
- lightweight
- end to end
- high fidelity
- multi band
- text to speech
- short time fourier transform
- multispectral
- real time
- high quality
- word processing
- ad hoc networks
- frequency band
- multipath
- hyperspectral
- video conferencing
- wireless sensor networks
- high resolution
- multiscale
- visual information
- coding scheme
- remote sensing
- fast fourier transform