Towards Real-World Streaming Speech Translation for Code-Switched Speech.
Belen AlastrueyMatthias SperberChristian GollanDominic TelaarTim NgAashish AgarwalPublished in: CoRR (2023)
Keyphrases
- speech recognition
- real world
- speech signal
- pattern recognition
- endpoint detection
- vocal tract
- speech synthesis
- audio visual
- automatic speech recognition systems
- language acquisition
- speaker recognition
- recognition engine
- data streams
- text to speech synthesis
- emotion recognition
- automatic speech recognition
- linear prediction
- text to speech
- speech processing
- source code
- open source
- spontaneous speech
- audio stream
- data sets