Login / Signup
End-to-end Speech Translation via Cross-modal Progressive Training.
Rong Ye
Mingxuan Wang
Lei Li
Published in:
CoRR (2021)
Keyphrases
</>
end to end
cross modal
multi modal
congestion control
image retrieval
multimedia retrieval
visual recognition
multimedia databases
information retrieval
similarity measure
visual data
image sequences
training set
data management
video data