Publication: A Case Study on Combining ASR and Visual Features for Generating Instructional Video Captions.