On the Benefits of Self-supervised Learned Speech Representations for Predicting Human Phonetic Misperceptions.
Santiago CuervoRicard MarxerPublished in: INTERSPEECH (2023)
Keyphrases
- speech recognition
- language acquisition
- automatic speech recognition
- higher level
- speech recognizer
- human communication
- speech signal
- spoken document retrieval
- spoken term detection
- human interaction
- human experts
- emotional speech
- audio visual
- human subjects
- dialogue system
- noisy environments
- computational models
- speaker identification
- artificial intelligence