Dynamic self-attention with vision synchronization networks for video question answering.

Published in: Pattern Recognit. (2022)

Keyphrases