Direct Vs Cascaded Speech-to-Speech Translation Using Transformer.

Lalaram Arya Amartya Chowdhury S. R. Mahadeva Prasanna

Published in: SPECOM (2) (2023)

Keyphrases

speech recognition
speech synthesis
speech signal
speaker identification
spoken language
data sets
recognition engine
broadcast news
endpoint detection
text to speech
machine learning
neural network
emotion recognition
automatic speech recognition
power system
speech processing
face detection
speech recognizer
vocal tract