TASTA: Text-Assisted Spatial and Temporal Attention Network for Video Question Answering.

Published in: Adv. Intell. Syst. (2023)

Keyphrases