OWL (Observe, Watch, Listen): Localizing Actions in Egocentric Video via Audiovisual Temporal Context.

Published in: CoRR (2022)

Keyphrases