Attention-based Encoder-Decoder Network for End-to-End Neural Speaker Diarization with Target Speaker Attractor.
Zhengyang ChenBing HanShuai WangYanmin QianPublished in: INTERSPEECH (2023)
Keyphrases
- end to end
- speaker diarization
- rate allocation
- congestion control
- internet protocol
- network architecture
- transport layer
- speech recognition
- low complexity
- video codec
- motion estimation
- neural network
- broadcast news
- source coding
- speaker identification
- rate control
- error control
- distributed video coding
- rate distortion