Estimation of speaker age and height from speech signal using bi-encoder transformer mixture model.
Tarun GuptaDuc-Tuan TruongTran The AnhChng Eng SiongPublished in: CoRR (2022)
Keyphrases
- mixture model
- speech signal
- gaussian mixture model
- speaker recognition
- speech recognition
- speaker identification
- density estimation
- automatic speech recognition
- language model
- vocal tract
- automatic speech recognition systems
- em algorithm
- mel frequency cepstral coefficients
- acoustic features
- probabilistic model
- generative model
- model selection
- maximum likelihood
- noisy environments
- unsupervised learning
- expectation maximization
- probability density function
- speaker verification
- bit rate
- maximum likelihood estimation
- hidden markov models
- bayesian information criterion
- pattern recognition
- machine learning
- probabilistic mixture model
- feature vectors
- object recognition
- learning algorithm
- information retrieval