Dynamic Graph Representation Learning for Video Dialog via Multi-Modal Shuffled Transformers.

Published in: AAAI (2021)

Keyphrases