Advanced Long-Content Speech Recognition With Factorized Neural Transducer.
Xun GongYu WuJinyu LiShujie LiuRui ZhaoXie ChenYanmin QianPublished in: IEEE ACM Trans. Audio Speech Lang. Process. (2024)
Keyphrases
- speech recognition
- hidden markov models
- pattern recognition
- language model
- speech synthesis
- speech recognizer
- speech processing
- speech recognition systems
- automatic speech recognition
- neural network
- speech signal
- speech recognition technology
- speaker identification
- speech understanding
- keyword spotting
- noisy environments
- handwriting recognition
- multimedia content
- speaker independent
- speech retrieval
- isolated word
- speaker dependent
- signal processing
- multimedia
- data mining