Learning to combine the modalities of language and video for temporal moment localization.

Jungkyoo Shin Jinyoung Moon

Published in: Comput. Vis. Image Underst. (2022)

Keyphrases

learning process
learning algorithm
object oriented programming
spatio temporal
video sequences
language learning
interactive video
knowledge acquisition
combining multiple
learning tasks
spatial and temporal
language acquisition
learning systems
online learning
real time
natural language
java programming
spatial temporal
video clips
space time
active learning
prior knowledge