Login / Signup
Video Captioning Using Attention Based Visual Fusion with Bi-temporal Context and Bi-modal Semantic Feature Learning.
Noorhan K. Fawzy
Mohammed A. Marey
Mostafa M. Aref
Published in:
AISI (2020)
Keyphrases
</>
temporal context
visual features
temporal information
semantic features
spatio temporal
prior knowledge
video data
machine learning
video sequences
low level
low level features
audio visual