Login / Signup

CPT: Cross-Modal Prefix-Tuning for Speech-To-Text Translation.

Yukun MaTrung Hieu NguyenBin Ma
Published in: ICASSP (2022)
Keyphrases
  • cross modal
  • multi modal
  • multimedia retrieval
  • visual data
  • data structure
  • visual recognition
  • perceptual information
  • visual similarity
  • search engine
  • image retrieval