Cross-Attentional Spatio-Temporal Semantic Graph Networks for Video Question Answering.

Published in: IEEE Trans. Image Process. (2022)

Keyphrases