Login / Signup
DiffS2UT: A Semantic Preserving Diffusion Model for Textless Direct Speech-to-Speech Translation.
Yongxin Zhu
Zhujin Gao
Xinyuan Zhou
Zhongyi Ye
Linli Xu
Published in:
EMNLP (2023)
Keyphrases
</>
diffusion model
speech recognition
speech signal
broadcast news
anisotropic diffusion
feature extraction
feature vectors
image registration
natural language processing
machine translation
influence maximization