Publication: Using audio and visual cues for speaker diarisation initialisation.