Login / Signup
CPT: Cross-Modal Prefix-Tuning for Speech-To-Text Translation.
Yukun Ma
Trung Hieu Nguyen
Bin Ma
Published in:
ICASSP (2022)
Keyphrases
</>
cross modal
multi modal
multimedia retrieval
visual data
data structure
visual recognition
perceptual information
visual similarity
search engine
image retrieval