VCVTS: Multi-speaker Video-to-Speech synthesis via cross-modal knowledge transfer from voice conversion.

Published in: CoRR (2022)

Keyphrases