Login / Signup
A Spatial Long-Term Iterative Mask Estimation Approach for Multi-Channel Speaker Diarization and Speech Recognition.
Feng Ma
Yanhui Tu
Maokui He
Ruoyu Wang
Shutong Niu
Lei Sun
Zhongfu Ye
Jun Du
Jia Pan
Chin-Hui Lee
Published in:
ICASSP (2024)
Keyphrases
</>
multi channel
speaker diarization
speech recognition
single channel
hidden markov models
pattern recognition
speaker identification
language model
speech signal
automatic speech recognition
handwriting recognition
speech recognition systems
noisy environments
computer vision
multi modal