Predicting audio-visual salient events based on visual, audio and text modalities for movie summarization.

Published in: ICIP (2015)

Keyphrases