Multimodal active speaker detection and virtual cinematography for video conferencing.

Published in: CoRR (2020)

Keyphrases