Separator-Transducer-Segmenter: Streaming Recognition and Segmentation of Multi-party Speech.
Ilya SklyarAnna PiunovaChristian OsendorferPublished in: INTERSPEECH (2022)
Keyphrases
- multi party
- recognition engine
- human communication
- word segmentation
- privacy preserving
- image parsing
- recognition rate
- numeral strings
- object recognition
- level set
- segmentation algorithm
- image segmentation
- speech recognition
- segmentation method
- word recognition
- medical images
- automatic speech recognition systems
- recognition algorithm
- data streams
- noisy environments
- speech corpus
- feature extraction
- handwriting recognition
- document analysis
- character recognition
- hidden markov models
- automatic speech recognition
- description language
- video streaming
- audio video
- audio visual
- speech recognition systems
- intelligent systems
- speech sounds
- automatic transcription
- cooperative