Login / Signup
Unsupervised Cross-Modal Alignment of Speech and Text Embedding Spaces.
Yu-An Chung
Wei-Hung Weng
Schrasing Tong
James R. Glass
Published in:
NeurIPS (2018)
Keyphrases
</>
cross modal
multi modal
multimedia retrieval
text retrieval
semi supervised
text mining
information retrieval
visual data
perceptual information
multimedia databases
visual recognition
image retrieval
visual similarity
topic models
image database
natural language processing
supervised learning
keywords