Publication: Vision talks: Visual relationship-enhanced transformer for video-guided machine translation.