Learning to combine the modalities of language and video for temporal moment localization.
Jungkyoo ShinJinyoung MoonPublished in: Comput. Vis. Image Underst. (2022)
Keyphrases
- learning process
- learning algorithm
- object oriented programming
- spatio temporal
- video sequences
- language learning
- interactive video
- knowledge acquisition
- combining multiple
- learning tasks
- spatial and temporal
- language acquisition
- learning systems
- online learning
- real time
- natural language
- java programming
- spatial temporal
- video clips
- space time
- active learning
- prior knowledge