Login / Signup
Discrete Cross-Modal Alignment Enables Zero-Shot Speech Translation.
Chen Wang
Yuchen Liu
Boxing Chen
Jiajun Zhang
Wei Luo
Zhongqiang Huang
Chengqing Zong
Published in:
CoRR (2022)
Keyphrases
</>
cross modal
multi modal
visual data
image retrieval
machine translation
visual recognition
multimedia retrieval
perceptual information
multimedia
multimedia databases
information retrieval
e learning
dimensionality reduction
visual features