AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head.
Rongjie HuangMingze LiDongchao YangJiatong ShiXuankai ChangZhenhui YeYuning WuZhiqing HongJiawei HuangJinglin LiuYi RenYuexian ZouZhou ZhaoShinji WatanabePublished in: AAAI (2024)
Keyphrases
- audio signals
- audio signal
- acoustic features
- audio features
- speech recognition
- endpoint detection
- digital audio
- speaker identification
- speech signal
- speaker recognition
- audio visual
- musical instruments
- audio recordings
- automatic speech recognition systems
- information retrieval
- audio content
- speech music discrimination
- music information retrieval
- computer music
- fundamental frequency
- music collections
- facial animation
- automatic speech recognition