Detach and Enhance: Learning Disentangled Cross-modal Latent Representation for Efficient Face-Voice Association and Matching.

Published in: ICDM (2022)

Keyphrases