Speech Transformer with Speaker Aware Persistent Memory.
Yingzhu ZhaoChongjia NiCheung-Chi LeungShafiq R. JotyEng Siong ChngBin MaPublished in: INTERSPEECH (2020)
Keyphrases
- speech recognition
- audio visual
- speaker recognition
- automatic speech recognition
- speaker verification
- speaker identification
- prosodic features
- vocal tract
- speaker dependent
- speech signal
- speech synthesis
- speaker diarization
- automatic speech recognition systems
- memory usage
- fuzzy logic
- acoustic features
- memory requirements
- speaker adaptation
- multi modal
- text to speech
- speech sounds
- computing power
- noisy environments
- emotion recognition
- gaussian mixture model
- synthesized speech
- speaker independent
- spoken language
- speech recognizer
- main memory
- memory size
- limited memory
- power system
- recognition engine
- decision making
- hidden markov models
- data structure
- multimedia