Login / Signup

DINO-VITS: Data-Efficient Noise-Robust Zero-Shot Voice Cloning via Multi-Tasking with Self-Supervised Speaker Verification Loss.

Vikentii PankovValeria ProninaAlexander KuzminMaksim BorisovNikita UsoltsevXingshan ZengAlexander GolubkovNikolai ErmolenkoAleksandra ShirshovaYulia Matveeva
Published in: CoRR (2023)
Keyphrases
  • noisy environments
  • data processing
  • image processing
  • data analysis
  • image data
  • genetic algorithm
  • pattern recognition
  • data points
  • speech recognition
  • cost effective