Amphion: An Open-Source Audio, Music and Speech Generation Toolkit.
Xueyao ZhangLiumeng XueYuancheng WangYicheng GuXi ChenZihao FangHaopeng ChenLexiao ZouChaoren WangJun HanKai ChenHaizhou LiZhizheng WuPublished in: CoRR (2023)
Keyphrases
- audio signals
- open source
- speech music discrimination
- audio features
- audio recordings
- digital audio
- audio visual
- audio signal
- music genre classification
- music information retrieval
- audio stream
- music score
- acoustic features
- speaker identification
- gaussian mixture model
- music retrieval
- speech corpus
- automatic music genre classification
- broadcast news
- source code
- emotion recognition
- visual features
- open source software
- hidden markov models
- low level
- genre classification
- case study
- digital video
- cepstral features
- music scores
- multi modal
- spoken documents
- feature set
- text to speech
- audio content
- polyphonic music
- audio files
- automatic speech recognition
- multi stream
- multimedia
- metadata