FluentEditor: Text-based Speech Editing by Considering Acoustic and Prosody Consistency.
Rui LiuJiatian XiZiyue JiangHaizhou LiPublished in: CoRR (2023)
Keyphrases
- prosodic features
- speech synthesis
- text to speech
- speech recognition
- speaker verification
- speech recognition systems
- speech sounds
- acoustic features
- speech signal
- spontaneous speech
- audio visual
- automatic speech recognition
- synthesized speech
- vocal tract
- speaker independent
- emotional speech
- formant frequencies
- acoustic signal
- multi stream
- endpoint detection
- recognition engine
- multimedia
- acoustic models
- word processing
- speaker recognition