Publication: Video captioning with recurrent networks based on frame- and video-level features and visual content classification.