Fine-Tuned Self-Supervised Speech Representations for Language Diarization in Multilingual Code-Switched Speech.
Geoffrey T. FrostEmily MorrisJoshua Jansen van VürenThomas NieslerPublished in: CoRR (2023)
Keyphrases
- text to speech
- spoken language
- speech recognition
- speech signal
- text to speech synthesis
- speaker identification
- language acquisition
- fine tuned
- english text
- multi lingual
- automatic speech recognition
- broadcast news
- speech synthesis
- audio visual
- digital libraries
- language resources
- speaker diarization
- natural language
- source code
- language learning
- programming language
- hidden markov models
- open source
- noisy environments
- dialogue system
- modeling language
- multilingual documents
- spoken dialog systems