DocLayoutTTS: Dataset and Baselines for Layout-informed Document-level Neural Speech Synthesis.
Puneet MathurFranck DernoncourtQuan Hung TranJiuxiang GuAni NenkovaVlad I. MorariuRajiv JainDinesh ManochaPublished in: INTERSPEECH (2022)
Keyphrases
- document level
- speech synthesis
- speech recognition
- language model
- sentence level
- text to speech
- query expansion
- sentiment classification
- vocal tract
- neural network
- document retrieval
- coreference resolution
- machine learning
- probabilistic model
- pseudo relevance feedback
- information retrieval
- language modeling
- sentiment analysis
- retrieval model
- pattern recognition