Publication: Temporal U-Nets for Video Summarization with Scene and Action Recognition.