Sign in

Towards an Interpretable Representation of Speaker Identity via Perceptual Voice Qualities.

Robin NetzorgBohan YuAndrea GuzmanPeter WuLuna McNultyGopala Anumanchipalli
Published in: CoRR (2023)
Keyphrases
  • real time
  • low level
  • speech recognition
  • learning algorithm
  • social networks
  • audio visual
  • speaker recognition
  • perceptual information