Estimation of speaker age and height from speech signal using bi-encoder transformer mixture model.

Published in: INTERSPEECH (2022)

Keyphrases