Publication: Beyond Frame-level CNN: Saliency-Aware 3-D CNN With LSTM for Video Action Recognition.