Exploring the multimodal information from video content using deep learning features of appearance, audio and action for video recommendation.

Published in: CoRR (2020)

Keyphrases