Publication: Divide and Conquer: Question-Guided Spatio-Temporal Contextual Attention for Video Question Answering.