Can we Automatically Transform Speech Recorded on Common Consumer Devices in Real-World Environments into Professional Production Quality Speech? - A Dataset, Insights, and Challenges.
Gautham J. MysorePublished in: IEEE Signal Process. Lett. (2015)
Keyphrases
- speech recognition
- speech synthesis
- text to speech
- automatic speech recognition
- speech signal
- spoken language
- endpoint detection
- broadcast news
- quality control
- emotion recognition
- speaker recognition
- pattern recognition
- recognition engine
- speech quality
- service quality
- synthetic datasets
- automatically generated
- data quality
- electronic commerce
- information technology
- speaker diarization
- hands free
- information retrieval