Login / Signup
SpeechUT: Bridging Speech and Text with Hidden-Unit for Encoder-Decoder Based Speech-Text Pre-training.
Ziqiang Zhang
Long Zhou
Junyi Ao
Shujie Liu
Lirong Dai
Jinyu Li
Furu Wei
Published in:
CoRR (2022)
Keyphrases
</>
text to speech synthesis
text to speech
lexical features
text recognition
multi lingual
english text
speech recognition
text input
information retrieval
language generation
spontaneous speech
text mining
motion estimation
motion vectors
training set
wyner ziv video coding
keywords