Direct Vs Cascaded Speech-to-Speech Translation Using Transformer.
Lalaram AryaAmartya ChowdhuryS. R. Mahadeva PrasannaPublished in: SPECOM (2) (2023)
Keyphrases
- speech recognition
- speech synthesis
- speech signal
- speaker identification
- spoken language
- data sets
- recognition engine
- broadcast news
- endpoint detection
- text to speech
- machine learning
- neural network
- emotion recognition
- automatic speech recognition
- power system
- speech processing
- face detection
- speech recognizer
- vocal tract