Login / Signup

Video Captioning Using Attention Based Visual Fusion with Bi-temporal Context and Bi-modal Semantic Feature Learning.

Noorhan K. FawzyMohammed A. MareyMostafa M. Aref
Published in: AISI (2020)
Keyphrases
  • temporal context
  • visual features
  • temporal information
  • semantic features
  • spatio temporal
  • prior knowledge
  • video data
  • machine learning
  • video sequences
  • low level
  • low level features
  • audio visual