Login / Signup
DiffS2UT: A Semantic Preserving Diffusion Model for Textless Direct Speech-to-Speech Translation.
Yongxin Zhu
Zhujin Gao
Xinyuan Zhou
Zhongyi Ye
Linli Xu
Published in:
CoRR (2023)
Keyphrases
</>
diffusion model
speech recognition
multiscale
speech signal
information systems
moving objects
social media
image data
diffusion process