Semantic retrieval of personal photos using a deep autoencoder fusing visual features with speech annotations represented as word/paragraph vectors.

Published in: INTERSPEECH (2015)

Keyphrases