Login / Signup
Multimodal active speaker detection and virtual cinematography for video conferencing.
Ross Cutler
Ramin Mehran
Sam Johnson
Cha Zhang
Adam Kirk
Oliver Whyte
Adarsh Kowdle
Published in:
CoRR (2020)
Keyphrases
</>
video conferencing
audio visual
video compression
distance learning
multi modal
collaborative environments
image processing
collaborative learning
image quality
speech recognition
d scene
visual information
machine learning
multimedia
three dimensional
image sequences