Automatic Speech-to-Speech Translation of Educational Videos Using SeamlessM4T and Its Use for Future VR Applications.
Lucas Rafael Stefanel GrisDiogo FernandesFrederico Santos de OliveiraPublished in: VR Workshops (2024)
Keyphrases
- speech recognition
- speech synthesis
- text to speech
- speech music discrimination
- endpoint detection
- virtual reality
- speech signal
- recognition engine
- audio visual
- semi automatic
- dialogue system
- spoken language
- speaker recognition
- speech transcripts
- video analysis
- key frames
- information retrieval
- fully automatic
- information extraction
- speech corpus
- long term
- video sequences