Self-Supervised Training of Speaker Encoder with Multi-Modal Diverse Positive Pairs.

Published in: CoRR (2022)

Keyphrases