Learning Long-Term Spatial-Temporal Graphs for Active Speaker Detection.

Kyle MinSourya RoySubarna TripathiTanaya GuhaSomdeb Majumdar
Published in: ECCV (35) (2022)