Sign in

AFL-Net: Integrating Audio, Facial, and Lip Modalities with Cross-Attention for Robust Speaker Diarization in the Wild.

Yongkang YinXu LiYing ShanYuexian Zou
Published in: CoRR (2023)
Keyphrases