Publication: Zero-Shot Action Recognition with Transformer-based Video Semantic Embedding.