Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale.
Matthew LeApoorv VyasBowen ShiBrian KarrerLeda SariRashel MoritzMary WilliamsonVimal ManoharYossi AdiJay MahadeokarWei-Ning HsuPublished in: CoRR (2023)
Keyphrases
- text generation
- multi lingual
- natural language generation
- text to speech synthesis
- text to speech
- natural language
- english text
- text input
- text mining
- scale space
- speech recognition
- dialogue system
- language independent
- speech signal
- lexical features
- information access
- language generation
- cross lingual
- free text
- text recognition
- probabilistic model
- multilingual documents
- information retrieval
- database
- cross language
- natural language processing
- information extraction
- spontaneous speech
- digital libraries