Sign in

Optimizing Latency for Online Video Captioning Using Audio-Visual Transformers.

Chiori HoriTakaaki HoriJonathan Le Roux
Published in: Interspeech (2021)
Keyphrases