Open Source MagicData-RAMC: A Rich Annotated Mandarin Conversational(RAMC) Speech Dataset.
Zehui YangYifan ChenLei LuoRunyan YangLingxuan YeGaofeng ChengJi XuYaohui JinQingqing ZhangPengyuan ZhangLei XieYonghong YanPublished in: INTERSPEECH (2022)
Keyphrases
- open source
- speech recognition
- broadcast news
- spoken language
- emotion recognition
- conversational speech
- open source software
- multi modal
- source code
- case study
- prosodic features
- synthetic datasets
- automatic speech recognition
- manually annotated
- speech signal
- audio visual
- pattern recognition
- high level
- conversational agent
- recognition engine
- spoken document retrieval
- speech recognizer
- ground truth labels
- real world
- annotated images
- text to speech
- speaker identification
- feature extraction