Balanced SNR-Aware Distillation for Guided Text-to-Audio Generation.
Bingzhi LiuYin CaoHaohe LiuYi ZhouPublished in: CoRR (2023)
Keyphrases
- text graphics
- text generation
- signal to noise ratio
- database
- multimedia
- free text
- information retrieval
- generation process
- noise reduction
- keywords
- text mining
- multimedia documents
- cross media retrieval
- feature selection
- human language
- text to speech
- text information
- natural language generation
- audio visual
- visual data
- text retrieval
- key concepts
- multi modal
- natural language
- hidden markov models
- signal processing
- audio content
- visual information
- web documents