X-TF-GridNet: A time-frequency domain target speaker extraction network with adaptive speaker embedding fusion.
Fengyuan HaoXiaodong LiChengshi ZhengPublished in: Inf. Fusion (2024)
Keyphrases
- speaker verification
- speech recognition
- speaker recognition
- audio visual
- data fusion
- speaker identification
- network traffic
- complex networks
- wireless sensor networks
- automatic speech recognition
- domain specific
- network structure
- network model
- neural network
- frequency domain
- domain independent
- multi modal
- signal processing
- computer networks
- peer to peer
- cross domain
- co occurrence
- wavelet transform