Publication: How to select a good training-data subset for transcription: submodular active selection for sequences.