Login / Signup
SpeechUT: Bridging Speech and Text with Hidden-Unit for Encoder-Decoder Based Speech-Text Pre-training.
Ziqiang Zhang
Long Zhou
Junyi Ao
Shujie Liu
Lirong Dai
Jinyu Li
Furu Wei
Published in:
EMNLP (2022)
Keyphrases
</>
text to speech synthesis
text to speech
text input
english text
text recognition
speech recognition
information retrieval
multi lingual
lexical features
spontaneous speech
hearing impaired
spoken documents
audio visual
text mining
keywords
video codec
speech signal
language generation
computational complexity