Cascade Cross-modal Attention Network for Video Actor and Action Segmentation from a Sentence.

Published in: ACM Multimedia (2021)

Keyphrases