Jointly Recognizing Speech and Singing Voices Based on Multi-Task Audio Source Separation.
Ye BaiChenxing LiHao LiYuanyuan ZhaoXiaorui WangPublished in: CoRR (2024)
Keyphrases
- audio features
- multi task
- source separation
- audio visual
- low level
- feature set
- learning tasks
- visual features
- multi class
- music information retrieval
- speaker identification
- feature selection
- audio signal
- text data
- multi modal
- music retrieval
- visual information
- learning problems
- transfer learning
- sound source
- high dimensional
- data mining
- model selection
- support vector
- multimedia
- information retrieval