Face-StyleSpeech: Improved Face-to-Voice latent mapping for Natural Zero-shot Speech Synthesis from a Face Image.
Minki KangWooseok HanEunho YangPublished in: CoRR (2023)
Keyphrases
- face images
- human faces
- face recognition
- facial expressions
- speech synthesis
- facial features
- face databases
- face matching
- face verification
- face model
- face recognition systems
- face space
- principal component analysis
- illumination variations
- gender recognition
- facial images
- low resolution
- face recognition algorithms
- automatic face
- high resolution
- feature extraction
- feature vectors
- probe image
- age estimation
- face representation
- training set
- frontal view
- text to speech
- feature points
- pose variations
- face identification
- facial expression recognition
- face pose
- lighting conditions
- human face recognition
- face representation and recognition
- input image
- face detection
- image set
- face tracking
- recognition rate
- feature selection
- face hallucination
- recognition algorithm
- recognizing faces