Login / Signup
NeXt-TDNN: Modernizing Multi-Scale Temporal Convolution Backbone for Speaker Verification.
Hyunjun Heo
Ui-Hyeop Shin
Ran Lee
Youngju Cheon
Hyung-Min Park
Published in:
ICASSP (2024)
Keyphrases
</>
speaker verification
multiscale
noisy environments
speaker recognition
image processing
prosodic features
spatio temporal
emotion recognition
audio visual
multilayer perceptron
language identification
edge detection
image representation
face verification
probabilistic model
image segmentation
neural network