Advanced Long-Content Speech Recognition With Factorized Neural Transducer.
Xun GongYu WuJinyu LiShujie LiuRui ZhaoXie ChenYanmin QianPublished in: CoRR (2024)
Keyphrases
- speech recognition
- hidden markov models
- speech processing
- speech synthesis
- pattern recognition
- speech recognizer
- automatic speech recognition
- speech recognizers
- language model
- speech understanding
- speech recognition technology
- noisy environments
- speaker identification
- speech recognition systems
- neural network
- speech signal
- multimedia content
- computer vision
- keyword spotting
- speech recognition errors
- speaker dependent
- speaker recognition
- speech retrieval
- speaker adaptation
- speaker independent
- multimedia
- information retrieval