Streaming Joint Speech Recognition and Disfluency Detection.
Hayato FutamiEmiru TsunooKentaro ShibataYosuke KashiwagiTakao OkudaSiddhant AroraShinji WatanabePublished in: CoRR (2022)
Keyphrases
- speech recognition
- noisy environments
- hidden markov models
- speech synthesis
- language model
- pattern recognition
- automatic speech recognition
- speech processing
- speech recognizer
- speech signal
- speech recognition technology
- speaker identification
- voice activity detection
- speaker dependent
- speech recognition systems
- keyword spotting
- speech understanding
- multimedia
- information retrieval
- speaker diarization
- speech recognition errors
- computer vision
- audio visual speech recognition